<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>News and insights Archives - TantusData</title>
	<atom:link href="https://tantusdata.com/insights-categories/news_insights/feed/" rel="self" type="application/rss+xml" />
	<link>https://tantusdata.com/insights-categories/news_insights/</link>
	<description>That uncovers wisdom.</description>
	<lastBuildDate>Wed, 09 Apr 2025 13:19:21 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.7.1</generator>

<image>
	<url>https://tantusdata.com/app/uploads/2023/01/cropped-Favicon-32x32.png</url>
	<title>News and insights Archives - TantusData</title>
	<link>https://tantusdata.com/insights-categories/news_insights/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Vendor lock-in when selecting a Cloud Data Platform architecture.</title>
		<link>https://tantusdata.com/insights/vendor-lock-in-when-selecting-a-cloud-data-platform-architecture/</link>
		
		<dc:creator><![CDATA[Marcin Szymaniuk]]></dc:creator>
		<pubDate>Wed, 09 Apr 2025 13:13:19 +0000</pubDate>
				<category><![CDATA[CloudMigration]]></category>
		<category><![CDATA[CloudStrategy]]></category>
		<category><![CDATA[DataWarehouse]]></category>
		<category><![CDATA[VendorLockIn]]></category>
		<guid isPermaLink="false">https://tantusdata.com/?post_type=insights&#038;p=2189</guid>

					<description><![CDATA[<p>Cloud Data Platform migrations come with hidden exit costs. Learn how to reduce vendor lock-in risk through smart architecture and technology choices.</p>
<p>The post <a href="https://tantusdata.com/insights/vendor-lock-in-when-selecting-a-cloud-data-platform-architecture/">Vendor lock-in when selecting a Cloud Data Platform architecture.</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<figure class="wp-block-image size-full"><img fetchpriority="high" decoding="async" width="960" height="540" src="https://tantusdata.com/app/uploads/2025/04/Grafiki-2.jpg" alt="" class="wp-image-2190" srcset="https://tantusdata.com/app/uploads/2025/04/Grafiki-2.jpg 960w, https://tantusdata.com/app/uploads/2025/04/Grafiki-2-300x169.jpg 300w, https://tantusdata.com/app/uploads/2025/04/Grafiki-2-768x432.jpg 768w" sizes="(max-width: 960px) 100vw, 960px" /></figure>



<p></p>



<h2 class="wp-block-heading">Migrating Data Platform, data warehouse or data lake?</h2>



<p>When deciding to move your data to the cloud, many people focus on costs, expected gains, easier maintenance, or simplified development. Sometimes, lower costs or easier maintenance drive the decision. However, it’s crucial to also consider the potential cost of exiting the cloud. What happens if, at some point, you decide you no longer want to be on a particular cloud platform? This could happen due to rising costs, new technology options, or even legal or political reasons.</p>



<p>Did you know that the exit-cost of your data platform from a Cloud might be a more expensive project than an original migration to the Cloud?<br><br>If you plan to migrate your data platform to the cloud, thinking about future migration costs now is a sign of responsible migration. Understand what will be the cost in terms of dollars, time, and effort if at some point you decide to exit that specific cloud vendor. This means recognizing that exit costs aren’t always obvious and that you’ll need to consider the various aspects of vendor lock-in.<br></p>



<h2 class="wp-block-heading">What exactly are the risks associated with vendor lock-in:</h2>



<ul class="wp-block-list">
<li><strong>Long term costs rise</strong>. If you don’t have an easy to move alternative you are at risk of becoming a hostage.</li>



<li><strong>Lack of flexibility</strong> &#8211; even if your solution is great now you always risk that it will not be developed in the future.&nbsp;</li>



<li><strong>High transfer fees</strong>. Most clouds are charging you if you transfer data from it (egress fee). That needs to be calculated when planning a migration from a specific cloud vendor.</li>



<li><strong>Contractual agreements</strong> &#8211; Vendor lock-in is not only about proprietary technology or data formats. What’s in the contract might be another trap which might become painful in the future.</li>



<li><strong>Lost optimization opportunities &#8211; </strong>there is no such thing as free lunch. Cloud solutions usually are easier to start with but your engineers might lose ability to do low level tuning if you even need that.</li>
</ul>



<p><mark style="background-color:rgba(0, 0, 0, 0)" class="has-inline-color has-black-color">When planning your cloud migration, don’t overlook the long-term exit costs &#8211; ask your provider about compatibility of specific technology with other tools in the market.&nbsp;</mark><br></p>



<h2 class="wp-block-heading">What can you do to mitigate the risk:</h2>



<ul class="wp-block-list">
<li>Evaluate the technology and alternatives – consider if there are open-source alternatives to your chosen data warehousing solution or if it&#8217;s compatible with other vendors</li>



<li>If you decide on proprietary technology make sure the cost of in and expected exit cost is justified by what you gain (usually easier, faster development)</li>



<li>Consider hybrid cloud. It’s more expensive but gives you more flexibility in the long run.</li>



<li>Consider favouring well-known standards and open source technologies and data formats. A good example is Kubernetes &#8211; it’s a technology which you can have on your own servers as well as with any major cloud providers.</li>
</ul>



<p>Considering moving to the cloud providers, ping me on Linkedin (<a href="https://www.linkedin.com/in/marcin-szymaniuk/">https://www.linkedin.com/in/marcin-szymaniuk/</a>) for specific calculations.</p>



<h2 class="wp-block-heading">Summary</h2>



<p>The data space is moving fast.&nbsp;</p>



<p>Always ask yourself what is the risk that within 5 years you have to migrate again.</p>



<p>Always ask yourself what will happen if you can’t use the selected data platform anymore. How long notice would you need to migrate to another solution?&nbsp;</p>



<p></p>
<p>The post <a href="https://tantusdata.com/insights/vendor-lock-in-when-selecting-a-cloud-data-platform-architecture/">Vendor lock-in when selecting a Cloud Data Platform architecture.</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Unleashing Innovation: A Glimpse into Our Exciting Event Journey</title>
		<link>https://tantusdata.com/insights/unleashing-innovation-event-journey/</link>
		
		<dc:creator><![CDATA[Magdalena Majka]]></dc:creator>
		<pubDate>Tue, 04 Jun 2024 09:52:16 +0000</pubDate>
				<category><![CDATA[learning]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[SPARK]]></category>
		<guid isPermaLink="false">https://tantusdata.com/?post_type=insights&#038;p=1989</guid>

					<description><![CDATA[<p>Whether you&#8217;ve joined us in the past or are planning to attend our upcoming events, there&#8217;s always something exciting on the horizon. Let&#8217;s take a look at where we&#8217;ve been and where we&#8217;re headed next, so that you can gain valuable insights from attending our speeches, participating in our workshops, or exploring our other content. [&#8230;]</p>
<p>The post <a href="https://tantusdata.com/insights/unleashing-innovation-event-journey/">Unleashing Innovation: A Glimpse into Our Exciting Event Journey</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<figure class="wp-block-image size-large"><img decoding="async" width="1024" height="683" src="https://tantusdata.com/app/uploads/2024/05/intro-1024x683.jpg" alt="" class="wp-image-2001" srcset="https://tantusdata.com/app/uploads/2024/05/intro-1024x683.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/intro-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/intro-768x512.jpg 768w, https://tantusdata.com/app/uploads/2024/05/intro-1536x1024.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/intro.jpg 1920w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure>



<p>Whether you&#8217;ve joined us in the past or are planning to attend our upcoming events, there&#8217;s always something exciting on the horizon. Let&#8217;s take a look at where we&#8217;ve been and where we&#8217;re headed next, so that you can gain valuable insights from attending our speeches, participating in our workshops, or exploring our other content. This season, our focus is on Large Language Models (LLMs) and Apache Spark, offering you a wealth of knowledge and practical skills to enhance your expertise. There&#8217;s a lot you can gain from engaging with our content and events, so don&#8217;t miss out!</p>



<h2 class="wp-block-heading">Past Events: Highlights and Memories &#8211; Top Learning Places to Keep in Your Calendar for Next Year</h2>



<figure class="wp-block-image size-large"><img decoding="async" width="1024" height="684" src="https://tantusdata.com/app/uploads/2024/05/2-1024x684.jpg" alt="" class="wp-image-1996" srcset="https://tantusdata.com/app/uploads/2024/05/2-1024x684.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/2-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/2-768x513.jpg 768w, https://tantusdata.com/app/uploads/2024/05/2-1536x1025.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/2-2048x1367.jpg 2048w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure>



<p><strong>Big Data Europe, November 2023</strong></p>



<p>We were honored to host two insightful workshops: &#8220;ChatGPT, LLMs, and LangChains&#8221; and &#8220;Apache Spark Performance Tuning&#8221;. The event was a phenomenal success, and it was great meeting everyone there. If you missed it or want to relive the experience, check out our exclusive videos <a href="https://bigdataconference.eu/?gad_source=1&amp;gclid=Cj0KCQjwgJyyBhCGARIsAK8LVLMHCCVIL8-3q_soH7PLrH2oDLc71BOytkbqzhXRwlwD1X1ljl8uJZgaAoh8EALw_wcB">here</a>.</p>



<figure class="wp-block-gallery has-nested-images columns-default is-cropped wp-block-gallery-1 is-layout-flex wp-block-gallery-is-layout-flex">
<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" data-id="1999" src="https://tantusdata.com/app/uploads/2024/05/1-1024x683.jpg" alt="" class="wp-image-1999" srcset="https://tantusdata.com/app/uploads/2024/05/1-1024x683.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/1-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/1-768x512.jpg 768w, https://tantusdata.com/app/uploads/2024/05/1-1536x1024.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/1-2048x1365.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>
</figure>



<p><strong>SMG Data Summit, January 23-24, 2024</strong></p>



<p>This two-day event, hosted by Google and organized by SMG Swiss Marketplace Group, was a deep dive into the world of data analytics and innovation. We led two workshops: &#8220;ChatGPT, LLMs, and LangChains&#8221; and &#8220;Deep Dive into AI &amp; ML for CxOs, Managers, and Business Leaders&#8221;. It was a fantastic opportunity to enhance our data expertise. More details can be found <a href="https://swissmarketplace.group/data-summit/">here</a>.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="684" src="https://tantusdata.com/app/uploads/2024/05/6-1024x684.jpg" alt="" class="wp-image-2003" srcset="https://tantusdata.com/app/uploads/2024/05/6-1024x684.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/6-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/6-768x513.jpg 768w, https://tantusdata.com/app/uploads/2024/05/6-1536x1025.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/6-2048x1367.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p><strong>Warsaw IT Days, April 5-6, 2024</strong></p>



<p>Celebrating its 15th anniversary, this iconic event gathered over 10,000 IT and Data Science enthusiasts. Our speeches, &#8220;AI Chats &#8211; What Nobody Told You: The Conundrums of Business Integration&#8221; and &#8220;Optimising Apache Spark and SQL for Improved Performance,&#8221; were well-received. Discover more about this event <a href="https://warszawskiedniinformatyki.pl/en/">here</a>.</p>



<p>If you missed any of these events, there&#8217;s still time to sign up for our upcoming ones or look out for next editions. We also invite you to explore the wealth of content already available on our blog and YouTube channel. Dive into our extensive library of articles, videos, and tutorials to stay updated and inspired.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://tantusdata.com/app/uploads/2024/05/kreatyw-media_004-1-1024x683.jpg" alt="" class="wp-image-2005" srcset="https://tantusdata.com/app/uploads/2024/05/kreatyw-media_004-1-1024x683.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/kreatyw-media_004-1-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/kreatyw-media_004-1-768x512.jpg 768w, https://tantusdata.com/app/uploads/2024/05/kreatyw-media_004-1-1536x1024.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/kreatyw-media_004-1-2048x1365.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p><strong>Conf42 LLM, April 11, 2024</strong></p>



<p>Marcin Szymaniuk will be diving into the complexities of integrating AI like ChatGPT into business frameworks at Conf42 LLMs. Learn how to optimize your resources for successful AI adoption. Don&#8217;t miss out – details <a href="https://lnkd.in/dF2f3m-t">here</a>.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://tantusdata.com/app/uploads/2024/05/screen-1024x683.jpg" alt="" class="wp-image-2007" srcset="https://tantusdata.com/app/uploads/2024/05/screen-1024x683.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/screen-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/screen-768x512.jpg 768w, https://tantusdata.com/app/uploads/2024/05/screen-1536x1024.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/screen-2048x1365.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p><strong>Big Data Technology Warsaw Summit, April 24-25, 2024</strong></p>



<p>Join us for technical presentations, interactive roundtables, and networking opportunities with over 600 attendees. We&#8217;ll be leading a roundtable on &#8220;What Product Owners and Managers Should Know About ML and LLMs: The Challenges of Business Integration&#8221;. Find out more <a href="https://bigdatatechwarsaw.eu/">here</a>.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://tantusdata.com/app/uploads/2024/05/4-1024x683.jpg" alt="" class="wp-image-2015" srcset="https://tantusdata.com/app/uploads/2024/05/4-1024x683.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/4-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/4-768x513.jpg 768w, https://tantusdata.com/app/uploads/2024/05/4-1536x1025.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/4-2048x1367.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p><strong>Data Analytics Meeting, May 17-18, 2024</strong></p>



<p>Our keynote, &#8220;A Short History of Data &#8211; Where Are We Aiming with AI?&#8221; will explore the evolution and future of data. This conference supports student development through discussions and networking. More information is available <a href="https://event.mostwiedzy.pl/event/55/page/398-o-konferencji">here</a>.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://tantusdata.com/app/uploads/2024/05/wirkshop1-1-1024x683.jpg" alt="" class="wp-image-2011" srcset="https://tantusdata.com/app/uploads/2024/05/wirkshop1-1-1024x683.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/wirkshop1-1-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/wirkshop1-1-768x512.jpg 768w, https://tantusdata.com/app/uploads/2024/05/wirkshop1-1-1536x1024.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/wirkshop1-1.jpg 1920w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p><strong>Infoshare, May 22-23, 2024</strong></p>



<p>The biggest tech and startup event in CEE, bringing together thousands of enthusiasts. We&#8217;ll be discussing &#8220;Optimising Apache Spark and SQL&#8221;. Don&#8217;t miss out – sign up <a href="https://infoshare.pl/conference/speakers/#speaker2010">here</a>.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://tantusdata.com/app/uploads/2024/05/wirkshop2-1024x683.jpg" alt="" class="wp-image-2013" srcset="https://tantusdata.com/app/uploads/2024/05/wirkshop2-1024x683.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/wirkshop2-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/wirkshop2-768x512.jpg 768w, https://tantusdata.com/app/uploads/2024/05/wirkshop2-1536x1024.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/wirkshop2.jpg 1920w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p><strong>Jfokus, May 28, 2024</strong></p>



<p>Join us at the Jfokus Training Camp in Stockholm for a focused one-day workshop on &#8220;LLMs and LangChains&#8221;. This is a great chance for hands-on learning and networking. Secure your spot <a href="https://www.jfokus.se/trainingday">here</a>.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://tantusdata.com/app/uploads/2024/05/wyklad-1024x683.jpg" alt="" class="wp-image-2020" srcset="https://tantusdata.com/app/uploads/2024/05/wyklad-1024x683.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/wyklad-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/wyklad-768x512.jpg 768w, https://tantusdata.com/app/uploads/2024/05/wyklad-1536x1024.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/wyklad.jpg 1920w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p><strong>Conf42 Machine Learning, May 30, 2024</strong></p>



<p>Marcin will share insights on Apache Spark SQL at this prestigious event. Check out the details <a href="https://www.conf42.com/ml2024">here</a>.</p>



<h2 class="wp-block-heading">Upcoming Events: Join Us and Stay Ahead</h2>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="684" src="https://tantusdata.com/app/uploads/2024/05/1-2-1024x684.jpg" alt="" class="wp-image-2022" srcset="https://tantusdata.com/app/uploads/2024/05/1-2-1024x684.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/1-2-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/1-2-768x513.jpg 768w, https://tantusdata.com/app/uploads/2024/05/1-2-1536x1025.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/1-2-2048x1367.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p><strong>Voxxed Days Luxembourg, June 20-21, 2024</strong></p>



<p>We&#8217;re excited to deliver an LLM workshop at this developer-focused event. Learn more and register <a href="https://luxembourg.voxxeddays.com/en/">here</a>.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="684" src="https://tantusdata.com/app/uploads/2024/05/3-1-1024x684.jpg" alt="" class="wp-image-2024" srcset="https://tantusdata.com/app/uploads/2024/05/3-1-1024x684.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/3-1-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/3-1-768x513.jpg 768w, https://tantusdata.com/app/uploads/2024/05/3-1-1536x1025.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/3-1-2048x1367.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p><strong>PyCon Estonia, September 5-6, 2024</strong></p>



<p>Join us for a workshop on LLMs at one of the largest Python gatherings in the Nordics. More details can be found <a href="https://pycon.ee/">here</a>.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://tantusdata.com/app/uploads/2024/05/SA11796-1-1024x683.jpg" alt="" class="wp-image-2029" srcset="https://tantusdata.com/app/uploads/2024/05/SA11796-1-1024x683.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/SA11796-1-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/SA11796-1-768x512.jpg 768w, https://tantusdata.com/app/uploads/2024/05/SA11796-1-1536x1024.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/SA11796-1-2048x1365.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p><strong>SREDAY, September 19-20, 2024</strong></p>



<p>We will be offering an LLM workshop at this in-person conference in London. Stay tuned for more updates <a href="https://sreday.com/">here</a>.</p>



<h2 class="wp-block-heading">Looking Ahead</h2>



<p>We have more exciting events coming soon and will be adding them to this article as they are confirmed. Stay tuned for updates and new opportunities to connect, learn, and innovate with us.</p>



<figure class="wp-block-gallery has-nested-images columns-default is-cropped wp-block-gallery-2 is-layout-flex wp-block-gallery-is-layout-flex">
<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="684" data-id="2033" src="https://tantusdata.com/app/uploads/2024/05/5-1024x684.jpg" alt="" class="wp-image-2033" srcset="https://tantusdata.com/app/uploads/2024/05/5-1024x684.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/5-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/5-768x513.jpg 768w, https://tantusdata.com/app/uploads/2024/05/5-1536x1025.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/5-2048x1367.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="684" data-id="2031" src="https://tantusdata.com/app/uploads/2024/05/9-1024x684.jpg" alt="" class="wp-image-2031" srcset="https://tantusdata.com/app/uploads/2024/05/9-1024x684.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/9-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/9-768x513.jpg 768w, https://tantusdata.com/app/uploads/2024/05/9-1536x1025.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/9-2048x1367.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="684" data-id="2035" src="https://tantusdata.com/app/uploads/2024/05/8-1024x684.jpg" alt="" class="wp-image-2035" srcset="https://tantusdata.com/app/uploads/2024/05/8-1024x684.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/8-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/8-768x513.jpg 768w, https://tantusdata.com/app/uploads/2024/05/8-1536x1025.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/8-2048x1367.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="684" data-id="2037" src="https://tantusdata.com/app/uploads/2024/05/7-1024x684.jpg" alt="" class="wp-image-2037" srcset="https://tantusdata.com/app/uploads/2024/05/7-1024x684.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/7-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/7-768x513.jpg 768w, https://tantusdata.com/app/uploads/2024/05/7-1536x1025.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/7-2048x1367.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>
</figure>



<p><strong>Trending Topics: Apache Spark and LLMs</strong></p>



<p>Two key topics that garnered significant interest at our events were Apache Spark and LLMs. Due to this high demand, we have dedicated articles on these subjects that are added to the recommendations below.</p>



<p>Stay tuned for more updates and see you at our next event!</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://tantusdata.com/app/uploads/2024/05/20240123_SMG_Data_Summit_Google_Offices_Zurich_0439-1024x683.jpg" alt="" class="wp-image-2039" srcset="https://tantusdata.com/app/uploads/2024/05/20240123_SMG_Data_Summit_Google_Offices_Zurich_0439-1024x683.jpg 1024w, https://tantusdata.com/app/uploads/2024/05/20240123_SMG_Data_Summit_Google_Offices_Zurich_0439-300x200.jpg 300w, https://tantusdata.com/app/uploads/2024/05/20240123_SMG_Data_Summit_Google_Offices_Zurich_0439-768x512.jpg 768w, https://tantusdata.com/app/uploads/2024/05/20240123_SMG_Data_Summit_Google_Offices_Zurich_0439-1536x1024.jpg 1536w, https://tantusdata.com/app/uploads/2024/05/20240123_SMG_Data_Summit_Google_Offices_Zurich_0439.jpg 1920w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>
<p>The post <a href="https://tantusdata.com/insights/unleashing-innovation-event-journey/">Unleashing Innovation: A Glimpse into Our Exciting Event Journey</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>TantusData Recognised as a Clutch Global Leader for Spring 2024</title>
		<link>https://tantusdata.com/insights/tantusdata-recognised-as-a-clutch-global-leader-for-spring-2024/</link>
		
		<dc:creator><![CDATA[Magdalena Majka]]></dc:creator>
		<pubDate>Mon, 27 May 2024 08:00:50 +0000</pubDate>
				<category><![CDATA[Clutch]]></category>
		<category><![CDATA[reviews]]></category>
		<guid isPermaLink="false">https://tantusdata.com/?post_type=insights&#038;p=2042</guid>

					<description><![CDATA[<p>TantusData named a top B2B company for Qlik, Hadoop, Tableau, Big Data Compliance, Fraud, &#038; Risk Management services.</p>
<p>The post <a href="https://tantusdata.com/insights/tantusdata-recognised-as-a-clutch-global-leader-for-spring-2024/">TantusData Recognised as a Clutch Global Leader for Spring 2024</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="1024" src="https://tantusdata.com/app/uploads/2024/05/Global-Award-Graphic-1024x1024.png" alt="" class="wp-image-2043" srcset="https://tantusdata.com/app/uploads/2024/05/Global-Award-Graphic-1024x1024.png 1024w, https://tantusdata.com/app/uploads/2024/05/Global-Award-Graphic-300x300.png 300w, https://tantusdata.com/app/uploads/2024/05/Global-Award-Graphic-150x150.png 150w, https://tantusdata.com/app/uploads/2024/05/Global-Award-Graphic-768x768.png 768w, https://tantusdata.com/app/uploads/2024/05/Global-Award-Graphic-1536x1536.png 1536w, https://tantusdata.com/app/uploads/2024/05/Global-Award-Graphic-2048x2048.png 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p>Clutch hasannounced its recognition of TantusData as a 2024 Spring Global Award winner for Qlik, Hadoop, Tableau, Big Data Compliance, Fraud, &amp; Risk Management services on Clutch, the leading global marketplace of B2B service providers.&nbsp;</p>



<p>Honourees are selected based on their industry expertise and ability to deliver scores that are calculated based on the client feedback from thousands of reviews published on Clutch.&nbsp;TantusData is honoured to be recognised as a 2024 Spring Clutch Global Award winner. This award is a testament to the excellent client work we have delivered this year as recognised through the voice of our customers in their reviews on Clutch. We&#8217;re proud to be recognised as aleader on a global scale. Clutch Global Awards showcases the very best in the B2B services industry worldwide.</p>



<blockquote class="wp-block-quote is-style-default is-layout-flow wp-block-quote-is-layout-flow">
<p>“We are incredibly proud to receive the 2024 Spring Clutch Global Award. This recognition is a direct reflection of our team’s hard work and dedication to delivering outstanding results for our clients. It’s an honour to see our efforts acknowledged on such a prestigious platform, and we are motivated to continue setting high standards in the industry.”</p>
<cite>Marcin Szymaniuk, CEO of TantusData</cite></blockquote>



<p></p>



<p></p>



<p>Since joining Clutch, TantusData has been delighted to receive multiple recognitions from the platform. Previously, we were listed as a leading service provider on Clutch, being named one of Poland&#8217;s industry game-changers in big data analytics. Read more about this recognition <a href="https://tantusdata.com/insights/tantusdata-a-leading-service-providers-clutch/">here</a>.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="576" height="1024" src="https://tantusdata.com/app/uploads/2024/05/Clutch_badges_TantusData-1-576x1024.jpg" alt="" class="wp-image-2091" srcset="https://tantusdata.com/app/uploads/2024/05/Clutch_badges_TantusData-1-576x1024.jpg 576w, https://tantusdata.com/app/uploads/2024/05/Clutch_badges_TantusData-1-169x300.jpg 169w, https://tantusdata.com/app/uploads/2024/05/Clutch_badges_TantusData-1-768x1365.jpg 768w, https://tantusdata.com/app/uploads/2024/05/Clutch_badges_TantusData-1-864x1536.jpg 864w, https://tantusdata.com/app/uploads/2024/05/Clutch_badges_TantusData-1-1152x2048.jpg 1152w, https://tantusdata.com/app/uploads/2024/05/Clutch_badges_TantusData-1-scaled.jpg 1440w" sizes="auto, (max-width: 576px) 100vw, 576px" /></figure>



<blockquote class="wp-block-quote has-text-align-left is-layout-flow wp-block-quote-is-layout-flow">
<p>“It is a joy to witness the incredible success of leading companies worldwide on our platform, and an even greater joy to recognise these companies as Clutch Global honourees. Their dedication to delivering next-level services to clients has not only bolstered their own success but empowered numerous clients to thrive as well. In recognising this spring’s Clutch Global honourees, we aim to showcase industry leaders and encourage connections for Clutch users seeking tailored services to achieve their goals.”&nbsp;</p>
<cite>Sonny Ganguly, Clutch CEO</cite></blockquote>



<p></p>



<p></p>



<p>View our recent work and reviews on <a href="https://clutch.co/profile/tantusdata#highlights">our Clutch profile</a>.</p>



<p>If you’re interested in hiring us for your next project, get in touch with us <a href="https://tantusdata.com/contact-us/">here</a>.</p>
<p>The post <a href="https://tantusdata.com/insights/tantusdata-recognised-as-a-clutch-global-leader-for-spring-2024/">TantusData Recognised as a Clutch Global Leader for Spring 2024</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Amplify Your Corporate Green Initiatives: Triumph in Sustainability and Reporting</title>
		<link>https://tantusdata.com/insights/amplify-corporate-green-initiatives-digital-sustainability/</link>
		
		<dc:creator><![CDATA[Magdalena Majka]]></dc:creator>
		<pubDate>Fri, 29 Mar 2024 14:01:45 +0000</pubDate>
				<category><![CDATA[Corporate Responsibility]]></category>
		<category><![CDATA[Digital Sustainability]]></category>
		<category><![CDATA[Green Technology]]></category>
		<guid isPermaLink="false">https://tantusdata.com/?post_type=insights&#038;p=1930</guid>

					<description><![CDATA[<p>Dive into how your company can elevate its environmental responsibility through digital sustainability. Discover key strategies for optimising digital efficiency, the impact of our digital world on the environment, and how adopting green digital practices can transform your business. Explore actionable insights and partner with TantusData to lead in sustainability and reporting.</p>
<p>The post <a href="https://tantusdata.com/insights/amplify-corporate-green-initiatives-digital-sustainability/">Amplify Your Corporate Green Initiatives: Triumph in Sustainability and Reporting</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="585" src="https://tantusdata.com/app/uploads/2025/01/Triple_reporting_TantusData_DigitalSustainability-1024x585.jpg" alt="" class="wp-image-2172" srcset="https://tantusdata.com/app/uploads/2025/01/Triple_reporting_TantusData_DigitalSustainability-1024x585.jpg 1024w, https://tantusdata.com/app/uploads/2025/01/Triple_reporting_TantusData_DigitalSustainability-300x171.jpg 300w, https://tantusdata.com/app/uploads/2025/01/Triple_reporting_TantusData_DigitalSustainability-768x439.jpg 768w, https://tantusdata.com/app/uploads/2025/01/Triple_reporting_TantusData_DigitalSustainability-1536x878.jpg 1536w, https://tantusdata.com/app/uploads/2025/01/Triple_reporting_TantusData_DigitalSustainability.jpg 1792w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<h2 class="wp-block-heading">Optimising Digital Efficiency: A Corporate Imperative</h2>



<p>In today’s digital-driven corporate landscape, the environmental footprint of our online activities, which encompass cloud services, computing, and the broader internet infrastructure, often escapes scrutiny. These critical components of modern business operations are not merely sources of operational inefficiency and financial expenditure; they significantly contribute to environmental degradation. Therein lies a pivotal call to action for businesses globally: to reassess and refine their digital infrastructure, spanning from the intricacies of <a href="https://www.sciencedirect.com/science/article/abs/pii/S0308596123002124">cloud storage systems</a> to the depths&nbsp; of computing processes. By elevating digital cleanliness to a strategic priority, organisations stand to not only meet emerging standards of <a href="https://www.itu.int/hub/2023/11/cutting-industry-emissions-and-fighting-the-climate-crisis/">digital sustainability reporting</a>—an accolade in corporate environmental stewardship—but also unveil substantial avenues for cost savings. Diving deeper into the ethos of digital sustainability reveals its capacity to forge a future that is not only environmentally harmonious but also streamlined for greater efficiency and financial prudence.</p>



<p>It&#8217;s crucial to acknowledge that certain areas prone to digital wastefulness, including redundant data storage, inefficient computing practices, and overlooked aspects of cloud utilisation, may remain hidden beneath the surface of day-to-day operations. There are also those which are even less obvious such as less energy efficient programming languages. These undercurrents of inefficiency represent missed opportunities for optimisation and sustainability. Importantly, while digital sustainability reporting is not currently a mandated practice in triple reporting, it serves as a powerful testament to an organisation’s commitment to environmental stewardship and technological foresight. Embracing this proactive approach not only positions a company as a leader in ecological responsibility but also as a visionary in harnessing the potential for long-term cost-effectiveness through digital sustainability. Let&#8217;s explore how embedding these principles within your corporate ethos can catalyse a transition towards a more sustainable, efficient, and economically viable future.</p>



<h2 class="wp-block-heading">The Environmental Impact of Our Digital World</h2>



<p>In today&#8217;s interconnected landscape, every digital interaction—from a cloud storage retrieval to sophisticated computations and online streaming—plays a part in energy consumption and elevates CO2 emissions. Merely scaling back online activities is not a sufficient response; a comprehensive and strategic approach to <a href="https://www.statista.com/statistics/1112982/ai-adoption-worldwide-industry-function/">#DigitalSustainability</a> is essential. The surge in AI and ML technologies integration, such as ChatGPT, into the fabric of corporate strategies has underscored this urgency. Recent statistics reveal that as of 2023, approximately <a href="https://www.statista.com/statistics/1384323/industries-using-chatgpt-in-business/">49% of companies have already incorporated ChatGPT</a> into their operations, with an additional 30% planning to embrace it. This trend is reflective of a broader embrace of artificial intelligence across industries, where a significant number of companies have adopted or plan to adopt AI technologies. Moreover, the AI market is poised for substantial growth, with projections indicating an <a href="https://www.statista.com/outlook/tmo/artificial-intelligence/worldwide">annual growth rate (CAGR 2024-2030) of 15.83%</a>. These figures not only highlight the rapid integration of AI into business processes but also emphasise the growing need for solutions that prioritise energy efficiency and sustainability in the digital domain. As AI and digital technologies become increasingly central to corporate operations, the push for environmentally conscious and sustainable digital infrastructures becomes critically important, urging companies to adopt practices that mitigate their digital environmental footprint.</p>



<h2 class="wp-block-heading">Clarifying the Internet&#8217;s Broad Reach</h2>



<p>The term &#8220;internet&#8221; extends far beyond browsing and email; it encompasses cloud services and computing, pivotal components of our digital ecosystem&#8217;s environmental impact. This vast infrastructure in 2020 was notably accountable for <a href="https://www.sciencedirect.com/science/article/abs/pii/S0308596123002124">1.4% of global emissions</a> and <a href="https://dl.acm.org/doi/10.1145/3613207">consumes 4% of the world’s electricity</a>, standing shoulder to shoulder with the aviation industry in terms of environmental impact. As we delve deeper into the digital age, the situation grows more pressing with digitalisation&#8217;s energy consumption marked at <a href="https://dl.acm.org/doi/10.1145/3613207">10% of worldwide energy consumption in 2023</a>. This figure is on an upward trajectory, especially with the proliferation of language learning models like ChatGPT, highlighting a concerning trend towards increasing energy demand. Further insights from <a href="https://www.enerdata.net/publications/executive-briefing/world-energy-consumption-from-digitalization.pdf">a comprehensive analysis</a> echo these sentiments, projecting a continuous rise in energy consumption fuelled by digitalisation.</p>



<p>In light of these developments, aligning with the Paris Agreement&#8217;s ambitious goals becomes even more critical. Specifically, the global ICT industry faces the daunting task of reducing its greenhouse gas (GHG) emissions by <a href="https://www.itu.int/hub/2023/11/cutting-industry-emissions-and-fighting-the-climate-crisis/">45% from 2020 levels by 2030</a>, a target that underscores the urgent need for sustainable practices across all sectors of digital infrastructure. This challenge, while formidable, presents an opportunity for organisations to lead by example, not just in adhering to but also in surpassing triple reporting standards, thereby demonstrating an unwavering commitment to environmental stewardship and sustainable development in the digital age.</p>



<h2 class="wp-block-heading">Why Digital Sustainability Matters for Your Business</h2>



<p>In the past decade, sustainability has transformed from a niche interest into a core component of corporate strategy, reflecting not only business priorities but also growing consumer concerns. However, the swift pace at which technology evolves, alongside slow-moving regulatory frameworks, has left many businesses scrambling to keep up. The expanding environmental footprint of big data, cloud computing, and AI technologies has become impossible to ignore, compelling companies to proactively adopt sustainable digital practices.</p>



<p>Adding to the urgency is the heightened public and corporate awareness brought about by innovations like ChatGPT and most recently Gemini, which have thrust the topic of big data into mainstream discussions. This visibility is likely to attract increased scrutiny from regulatory bodies and could result in sustainability metrics being incorporated into triple reporting standards in the near future. By taking the initiative to audit and refine your data management practices now, your company can not only stay ahead of potential regulatory changes but also demonstrate leadership in digital sustainability.</p>



<p>With Earth Day approaching, there&#8217;s a timely and compelling opportunity to position your company at the forefront of sustainable digital operations. This moment serves as a reminder of the importance of integrating environmental considerations into every aspect of your business, especially in areas that are often out of the public eye, such as digital infrastructure. By committing to sustainable digital practices today, your company can lead by example, showing that a commitment to the planet is not just good ethics—it&#8217;s also good business.</p>



<h2 class="wp-block-heading">Identifying Opportunities for Digital Streamlining</h2>



<p>For companies dedicated to minimising their digital environmental impact, a thorough examination of digital operations is paramount. Identifying inefficiencies and areas for optimisation can significantly reduce digital waste and energy consumption. Here’s a deeper look into key areas for digital streamlining:</p>



<ul class="wp-block-list">
<li><strong>Evaluating Data Pipelines:</strong> Assess whether all data pipelines in use are essential or if some have become redundant. Redundant pipelines can lead to unnecessary energy consumption and complicate data management. For a practical example, explore our <a href="https://tantusdata.com/success-stories/failing-silently-case-study/">case study on a silently failing pipeline</a>, demonstrating the importance of vigilance in pipeline efficiency.</li>



<li><strong>Ensuring Infrastructure Efficiency:</strong> Verify that all cloud and computing infrastructures are functioning optimally. Inefficient infrastructures can increase energy usage and operational costs. Our <a href="https://tantusdata.com/success-stories/introducing-apache-airflow-telecom-data-migration/">case study on improving the orchestration of pipelines</a> highlights the benefits of streamlining infrastructure for enhanced efficiency.</li>



<li><strong>Addressing Computing Errors:</strong> Issues and errors in computing not only delay operations but also waste computational resources, leading to increased energy usage. The<a href="https://tantusdata.com/success-stories/sharing-the-magicians-toolkit/"> case of malfunctions</a> case study showcases strategies for addressing computing errors effectively.</li>



<li><strong>Optimising Job Execution:</strong> Examine if there are issues with running jobs inefficiently. Streamlining job execution can reduce processing time and energy expenditure. Our case study on <a href="https://tantusdata.com/success-stories/fixing-the-other-side-of-custom-made/">a non-efficient bespoke solution</a> illustrates the importance of optimising job execution.</li>



<li><strong>Purifying Data Storage:</strong> Identify and eliminate clean data repetitions or redundancies. This is most often the issue resulting from low data quality. Efficient data storage reduces the need for extensive computing power to manage and access data. You can find more on ensuring data quality in <a href="https://www.youtube.com/watch?v=wj73Zgqlyic">our video guide.</a></li>



<li><strong>Decluttering Cloud Space:</strong> Remove unnecessary items from cloud storage. Very often, even the most organised companies leave old documents even after they have become entirely obsolete. Compared to physical copies cluttering the office, the cloud versions are easier to forget. Make sure that you are not paying for the storage of unnecessary items. This practice frees up space and reduces the energy required for data retrieval and maintenance.</li>



<li><strong>Avoiding Data Duplication:</strong> Ensure data is not duplicated across departments. Quite often, organisations are divided based on their functions. In those cases, various departments collect the same data instead of using one database. This duplicates the storage with each team. Centralised data management enhances efficiency and reduces storage requirements.</li>



<li><strong>Streamlining User Journeys:</strong> Simplify user interfaces and processes to reduce the computational power required for navigating through systems.</li>



<li><strong>Enhance LLM Efficiency</strong>: To maximise digital sustainability and conserve computing resources while using energy-intensive LLMs like ChatGPT, consider a multifaceted approach that streamlines information retrieval and optimises query responses. By refining your system to deliver more relevant answers, you can significantly reduce the volume of queries. Implementing <a href="https://www.researchgate.net/publication/367543986_Improving_NoSQL_Spatial-Query_Processing_with_Server-Side_In-Memory_R-Tree_Indexes_for_Spatial_Vector_Data">advanced indexing techniques</a> and preprocessing data ensures that searches are both energy-efficient and produce contextually richer responses. This strategy not only accelerates access to information through efficient retrieval processes but also minimises unnecessary data processing. Furthermore, incorporating <a href="https://chat.openai.com/c/bc636887-b60f-4c70-b894-b78cdb809c21">covered queries</a> by leveraging indexed documents pre-answers common inquiries, thus enhancing performance and saving additional energy and resources. This holistic approach enhances LLM&#8217;s operational efficiency and contributes to a more sustainable digital environment.</li>



<li><strong>Choosing Efficient Programming Languages, Frameworks and Databases:</strong> Some programming languages are more energy-efficient than others. Selecting an efficient language can reduce the computational power needed for the same tasks.</li>



<li><strong>Removing Unused Code:</strong> Libraries often contain code that is no longer in use. Identifying and removing such code can decrease the amount of processing required during operations.</li>
</ul>



<p>By focusing on these areas, companies can take significant steps towards reducing their digital footprint. Each of these strategies not only contributes to a more sustainable digital environment but also enhances operational efficiency and cost-effectiveness. These are some key considerations, and many more aspects can be identified and analysed during a comprehensive digital audit to further streamline operations and minimise environmental impact.</p>



<h2 class="wp-block-heading">Partnering with TantusData for Enhanced Digital Sustainability</h2>



<p>TantusData is at the forefront of assisting companies in navigating the complexities of digital sustainability. Our focus on optimising data management, computing efficiency, and overall digital infrastructure supports not only your environmental objectives but also contributes to operational cost savings. By collaborating with TantusData, your organisation can achieve sustainability milestones and receive certification for your green digital practices, enhancing your corporate value in the process.</p>



<h2 class="wp-block-heading">Your Business’s Path Forward: A Sustainable Digital Future</h2>



<p>Adopting sustainable digital practices is more than an environmental responsibility—it’s a strategic advantage in today’s corporate world. As we continue to navigate the digital revolution, it’s imperative for companies to integrate sustainability into their digital operations. This is an invitation for your business to explore how TantusData can transform your digital footprint into an asset for both the planet and your bottom line. Reach out to us to discover how we can help you achieve a leading edge in digital sustainability.</p>



<p>Let’s <a href="https://tantusdata.com/contact-us/">get in touch.</a></p>
<p>The post <a href="https://tantusdata.com/insights/amplify-corporate-green-initiatives-digital-sustainability/">Amplify Your Corporate Green Initiatives: Triumph in Sustainability and Reporting</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>April Fools&#8217; Reveal: A Journey to Digital Sustainability!</title>
		<link>https://tantusdata.com/insights/april-fools-to-digital-sustainability/</link>
		
		<dc:creator><![CDATA[Magdalena Majka]]></dc:creator>
		<pubDate>Fri, 01 Dec 2023 14:39:19 +0000</pubDate>
				<category><![CDATA[Digital Sustainability]]></category>
		<category><![CDATA[EcoFriendlyTech]]></category>
		<guid isPermaLink="false">https://tantusdata.com/?post_type=insights&#038;p=1952</guid>

					<description><![CDATA[<p>Gotcha! Our announcement of Underwater Data Centers was a playful April Fools&#8217; Day jest. While we won&#8217;t be storing data in the deep blue sea, we&#8217;re serious about the importance of digital sustainability. As we all navigate the digital landscape, the environmental impact of our online activities requires thoughtful consideration and action. We encourage you [&#8230;]</p>
<p>The post <a href="https://tantusdata.com/insights/april-fools-to-digital-sustainability/">April Fools&#8217; Reveal: A Journey to Digital Sustainability!</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="576" src="https://tantusdata.com/app/uploads/2024/03/white-april-fools-day-instagram-post-1024x576.jpg" alt="" class="wp-image-1953" srcset="https://tantusdata.com/app/uploads/2024/03/white-april-fools-day-instagram-post-1024x576.jpg 1024w, https://tantusdata.com/app/uploads/2024/03/white-april-fools-day-instagram-post-300x169.jpg 300w, https://tantusdata.com/app/uploads/2024/03/white-april-fools-day-instagram-post-768x432.jpg 768w, https://tantusdata.com/app/uploads/2024/03/white-april-fools-day-instagram-post-1536x864.jpg 1536w, https://tantusdata.com/app/uploads/2024/03/white-april-fools-day-instagram-post.jpg 1920w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p>Gotcha! Our announcement of Underwater Data Centers was a playful April Fools&#8217; Day jest. While we won&#8217;t be storing data in the deep blue sea, we&#8217;re serious about the importance of digital sustainability. As we all navigate the digital landscape, the environmental impact of our online activities requires thoughtful consideration and action.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="576" src="https://tantusdata.com/app/uploads/2024/03/Simple-Corporate-Sustainability-Linkedin-Video-Ad-Presentation-1024x576.jpg" alt="" class="wp-image-1955" srcset="https://tantusdata.com/app/uploads/2024/03/Simple-Corporate-Sustainability-Linkedin-Video-Ad-Presentation-1024x576.jpg 1024w, https://tantusdata.com/app/uploads/2024/03/Simple-Corporate-Sustainability-Linkedin-Video-Ad-Presentation-300x169.jpg 300w, https://tantusdata.com/app/uploads/2024/03/Simple-Corporate-Sustainability-Linkedin-Video-Ad-Presentation-768x432.jpg 768w, https://tantusdata.com/app/uploads/2024/03/Simple-Corporate-Sustainability-Linkedin-Video-Ad-Presentation-1536x864.jpg 1536w, https://tantusdata.com/app/uploads/2024/03/Simple-Corporate-Sustainability-Linkedin-Video-Ad-Presentation.jpg 1920w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p>We encourage you to dive into our genuine insights on digital sustainability and learn how we can collectively reduce our environmental footprint in the digital age. It&#8217;s time to make every click, upload, and stream count towards a more sustainable future.</p>



<p>Explore our strategies and join the conversation at <a href="https://tantusdata.com/insights/amplify-corporate-green-initiatives-digital-sustainability/">Amplify Corporate Green Initiatives with Digital Sustainability</a>.</p>
<p>The post <a href="https://tantusdata.com/insights/april-fools-to-digital-sustainability/">April Fools&#8217; Reveal: A Journey to Digital Sustainability!</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>The TantusData Summer Challenge: terms and conditions</title>
		<link>https://tantusdata.com/insights/the-tantusdata-summer-challenge-terms-and-conditions/</link>
		
		<dc:creator><![CDATA[TantusData]]></dc:creator>
		<pubDate>Fri, 14 Jul 2023 11:32:05 +0000</pubDate>
				<guid isPermaLink="false">https://tantusdata.com/?post_type=insights&#038;p=1598</guid>

					<description><![CDATA[<p>Rules of the “Summer Challenge” contest on LinkedIn § 1 [Definitions] The phrases and terms used in the content of these Rules shall have the meanings indicated below:1. „Organizer&#8221; &#8211; TantusData Sp. z o.o. based in POLAND, ul.&#160; Alexa Niepodległości 132/136 unit 3, 02-544 Warszawa, under KRS number: 0000930059, NIP: 5213944638, REGON: 520344085, being the [&#8230;]</p>
<p>The post <a href="https://tantusdata.com/insights/the-tantusdata-summer-challenge-terms-and-conditions/">The TantusData Summer Challenge: terms and conditions</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<h2 class="wp-block-heading">Rules of the “Summer Challenge” contest on LinkedIn</h2>



<p></p>



<p><strong>§ 1 [Definitions]</strong></p>



<p><br>The phrases and terms used in the content of these Rules shall have the meanings indicated below:<br>1. „Organizer&#8221; &#8211; TantusData Sp. z o.o. based in POLAND, ul.&nbsp; Alexa Niepodległości 132/136 unit 3, 02-544 Warszawa, under KRS number: 0000930059, NIP: 5213944638, REGON: 520344085, being the owner of the TantusData brand;<br>2. &#8220;Rules&#8221; &#8211; these Rules of the Contest entitled “TantusData Summer Challenge”, binding for the Organizer and Contestants, regulating the terms and conditions of the Contest, in particular terms of participation in the Contest, rights and obligations of the Organizer, Contest Committee and Contestants in relation to their participation in the Contest;<br>3. “Contest&#8221; &#8211; the Contest entitled “TantusData Summer Challenge&#8221;, binding for the Organizer and Contestants, regulating the terms and conditions of the Contest, in particular terms of participation in the Contest, rights and obligations of the Organizer, Contest Committee and Contestants in relation to their participation in the Contest;<br>4. &#8220;LinkedIn&#8221; &#8211; the website under the domain www.linkedin.com , where the Contest is partially held, owned by the Microsoft corporation.<br>5. &#8220;Fanpages&#8221; &#8211; the profile named “@TantusData&#8221; on the LinkedIn website available at: <em>https://www.linkedin</em> owned and administered by the Organizer;<br>6. “Contestant&#8221; &#8211; a LinkedIn user who has made a correct and effective entry to the Contest, meeting the eligibility requirements for participation in the Contest as described in § 3 of the Rules;<br>8. “Contest Committee” &#8211; a team consisting of persons selected by the Organizer, which evaluates the entries;</p>



<p>9. „Winner&#8221; &#8211; a User who has submitted a correct and effective entry to the Contest and whose entry was awarded by the Selection Committee in accordance with the provisions of §4 and §6 of these Terms and Conditions.</p>



<p><br><strong>§ 2 [General provisions]</strong></p>



<p><br>1. The Organiser is the founder of the prizes.<br>2. The Organizer is the administrator of personal data provided by Contestants.<br>3. Providing personal data is optional, but necessary for the Participant to enter the Competition. Persons who make their data available have the right to access, modify or delete them.<br>4. These Rules define the conditions of the Contest.<br>5. The Contest is not created, administered, endorsed or sponsored by the social networking site LinkedIn or by the Microsoft Corporation or by Kaggle with Google LLC. &#8220;LinkedIn&#8221; is a registered trademark by the <a href="https://trademarks.justia.com/owners/linkedin-corporation-1472257/">LinkedIn Corporation</a>. <br>6. The competition is carried out on User’s profile and Organizers Fanpage.<br>7. The Organizer&#8217;s Contest Committee supervises the correctness and course of the Contest, i.e. providing information on the Contest and dealing with complaints.<br>8. The Competition is conducted from month 21st July 2023 till September 16th 2023.</p>



<p></p>



<p><strong>§ 3 [Contestants]</strong></p>



<p><br>1. Participants in the Contest may only be natural persons who are consumers, have full legal capacity, are LinkedIn users, have an active account on LinkedIn, abide by the rules of LinkedIn service and who have accepted these Rules.<br>2. Contestant declares, that he/she/they:<br>a. is a natural person with full legal capacity;<br>b. is domiciled in the territory of <em>European Union or United Kingdom of Great Britain and Northern Ireland;</em><br>c. is familiar with the content of the present Rules and voluntarily enters the Contest;</p>



<p>d. agrees to and accepts the terms of these Rules, including the procedure for claiming prizes, and fully accepts them;</p>



<p>e. undertakes to comply with the provisions of these Rules and the rules of the LinkedIn service;</p>



<p>f. has agreed to the processing of his/her/their personal data for the purposes of participating in the Contest;</p>



<p>g. is a registered user of the LinkedIn social network.</p>



<p>3. The employees and collaborators of the Organizer cannot participate in the Contest.</p>



<p><br><strong>§ 4 [Prize]</strong></p>



<p></p>



<p>1. the Prizes will be awarded to the first three (3) persons who, in the opinion of the Contest Committee, best complete the Contest Task and are selected by the Contest Committee as Winners.</p>



<p>2. The Organizer will award 3 prizes to the Participants who will be declared Winners (&#8220;Prizes&#8221;), one award per winner. The Prizes in the Contest are respectively:</p>



<p>a) For first place – an office survival kit containing: &nbsp;50 EURO Amazon gift card, a notebook, 2 post its, 5 pens, a mug and a tea, a bag, a t-shirt, cookies, an adult colouring book, crayons,</p>



<p>b) For second place – an office survival kit containing: &nbsp;50 EURO Amazon gift card, a notebook, 2 post its, 5 pens, a mug and a tea, a bag, a t-shirt, cookies, an adult colouring book, crayons,</p>



<p>c) For third place – an office survival kit containing: &nbsp;50 EURO Amazon gift card, a notebook, 2 post its, 5 pens, a mug and a tea, a bag, a t-shirt, cookies, an adult colouring book, crayons,</p>



<p>3. Prizes will be issued only in the form specified herein, without the possibility of exchanging them for another material prize or for a cash equivalent.&nbsp;</p>



<p>4. The prizes will be issued to the Winners in a manner communicated to them in further communication.</p>



<p>5. The Winners will be informed about the prize and the conditions of Prize collection in the content of the announcement on Fanpage and through a private message sent to the Winners on LinkedIn within 10 working days after the end of the Contest.</p>



<p>6. By entering the Contest, the Contestant agrees that the Organizer may communicate with him/her in connection with the Contest via the LinkedIn account.</p>



<p>7. in order for the Winner to receive the Prize, the necessary data must be sent to the Organizer within 48 hours of the prize communication being sent to the Winner (to the email provided by the Organizer in the message). The necessary data for the Prize release to take place are:</p>



<p>a) name and surname;</p>



<p>b) residence address (for tax purposes)</p>



<p>c) correspondence address (for the purposes of delivering the Prize, it may be different from the residential address);</p>



<p>d) telephone number;</p>



<p>e) e-mail address.</p>



<p>9. Dispatch of the Prizes will take place without unnecessary delay from the day of the end of the Contest. The prizes shall be sent via post to the address indicated by the Winner. The Organizer shall not be liable for the actions or omissions of the delivery service providers.</p>



<p>10. The Winner may surrender the Prize but shall not be entitled to the cash equivalent or any other prize. In the event of the Prize being forfeited, the Organiser reserves the right to award the Prize to another Entrant or decide not to award a Prize. The Winner may not transfer the right to the Prize to third parties.</p>



<p>11. The Winner&#8217;s failure to provide the Organiser with the data referred to in point 7 above or exceeding the permissible time of response, or sending incorrect data, the Winner loses the right to the Prize.</p>



<p>12. The Organizer reserves the right to verify whether the Winners fulfil the conditions stipulated in the Regulations as well as in legal regulations. For this purpose, the Organizer may require the Winner to make specific statements, provide specific data or submit specific documents. Failure to comply with the Regulations or relevant provisions of law, as well as refusal to comply with the above demands, may result in the exclusion of a given person from the Contest, and shall entitle the Organiser to refuse to award the Prize without any claims against the Organiser.</p>



<p>13. All participants who complete the Contest Task appropriately as determined by the Contest Committee, will be awarded with online certificates of successful completion. The certificates will be issued within 14 days of the Contest closing date and will be sent to the participants via the LinkedIn&#8217;s message service. The certificates confirm only the successful completion of the Contest Task and the Organiser is not responsible for the verification of the User&#8217;s completion process, only of the correct solution submitted. </p>



<p><br><strong>§ 5 [Rules of the Contest]</strong></p>



<p></p>



<p>1.<strong> </strong>The Contestant&#8217;s task is tocreate a model to detect whether the same author has written two texts using the datasets provided by TantusData within the challenge on the Kaggle platform. The data provided was collected from articles from our blog and two publicly available datasets of news&nbsp;and blog posts. Participants can also use external datasets. The participants are asked to create a smart solution which can take advantage of even a small dataset. The participants must present this in the format specified on Kaggle under the evaluation tab. The participants are also asked to write a description in a Kaggle notebook and make it public after the submission deadline. The participants are required to submit their entries in Kaggle under the submissions tab for the competition, as specified in the overview. This is the complete Summer Challenge task (hereinafter: &#8220;Contest Task&#8221;). Full instructions and guidelines are available on Kaggle under the challenge section.</p>



<p>2. One Contestant may make multiple submissions, but will be asked to select their best attempt at the Contest Task. The Contestant must work alone.</p>



<p>3. Contestant is forbidden to take any action in connection with participation in the Contest contrary to the law and good manners, and to use data obtained in connection with participation in the Contest for illegal purposes. In particular in the content of an answer to a Competition Task a Contestant cannot include:</p>



<p>a) materials and symbols against the law;</p>



<p>b) materials and symbols violating the rights of third parties or the Organizer, especially</p>



<p>those violating intellectual property rights or the personal rights of third parties or the Organizer;</p>



<p>c) expressions commonly regarded as morally reprehensible or socially inappropriate as well as content violating good morals;</p>



<p>d) obscene or pornographic materials and content;</p>



<p>e) materials and symbols propagating violence or discrimination, inciting racial, religious or ethnic hatred, socially recognised as offensive, vulgar, etc.;</p>



<p>f) content violating the rules of so-called &#8220;netiquette&#8221;;</p>



<p>g) personal data of other people in an unlawful scope, in particular it is forbidden to send an answer to a contest task using someone else&#8217;s name and surname in order to impersonate a particular person;</p>



<p>h) materials, the use of which by the Organiser may hinder or prevent the operation of other programs used by the Organiser, especially materials containing computer viruses.</p>



<p>4. The answers to the Contest task, which do not meet the requirements specified in the Rules, will be excluded from the Contest.</p>



<p>5. In order to ensure the proper organisation and conduct of the Contest, and in particular to assess the accuracy of entries and select the Winners, the Organiser appoints a Contest Committee, which will supervise and arbitrate the Contest.</p>



<p>6. The Contest Committee will award the accurate solutions to the Contest Task, most swiftly and appropriately realising the Contest Task. Submissions are evaluated based on the accuracy score&nbsp;between the predicted label and the true target and a description of a scaled and optimal solution that best fulfils the task with sensible use of resources.</p>



<p>7. From among the submitted Contest Tasks, the Contest Committee will select the Winners, indicating the decisive features of the selection and deciding also on the allocation of places and the distribution of Prizes.</p>



<p>8. The Organizer will also inform about the Winners in a public announcement on the Fanpage.</p>



<p>9. Detailed information about the contest will be available on the Organizer&#8217;s website.</p>



<p><br><strong>§ 6 [Conditions of participation in the Contest]</strong></p>



<p></p>



<p>1. Access to the Contest is free of charge.</p>



<p>2. Necessary conditions for participation in the Contest are:</p>



<p>a. acceptance of the Rules and the correct completion of all tasks described in § 5 Paragraph 1 of the Rules;</p>



<p>b. granting by the Contestant consent to the processing of personal data described in these Regulations by the Organizer (in particular consent described in § 8 of the Rules);</p>



<p>c. transfer of copyrights as referred to in §10 of the Rules.</p>



<p><br><strong>§ 7 [Organiser&#8217;s responsibilities]</strong></p>



<p></p>



<p>1. The Organiser shall not be responsible for the accuracy and truthfulness of the data of the Contestants, including the inability to pass the Prizes, due to reasons attributable to the Participant, in particular if the Participant did not provide a real mailing address, or if the data provided is incomplete or outdated.</p>



<p>2. The Organiser declares that it does not control or monitor the content posted by Participants in terms of reliability and truthfulness, subject to actions related to the removal of violations of the Regulations or generally applicable laws.</p>



<p>3. The Organizer reserves the right to exclude from the Contestants whose actions contravene the law or the Rules, LinkedIn rules, in particular Contestants who:</p>



<p>a) post content that contravenes the applicable law, Rules, LinkedIn rules (in particular, containing offensive content, both in the text and graphic layer);</p>



<p>b) Complete the Contest Task in with help (not on their own);</p>



<p>c) Interfere with the Contest mechanism;</p>



<p>4. Organizer is not responsible for any malfunctions in data communications links, servers, interfaces, browsers, LinkedIn platform, Kaggle platform.</p>



<p>5. The Organizer is not responsible for temporary or permanent blocking of the LinkedIn profiles and pages or any of its mobile applications.</p>



<p></p>



<p><strong>§ 8 [Processing of personal data]</strong></p>



<p></p>



<p>1. Personal data of Contestants, including image, shall be processed by Organizer only for the purpose of performing activities necessary for proper conduct of the Contest.</p>



<p>2. Personal data of the Competition Participants will be stored by the Organiser only for the time necessary to conduct the Competition and award the prizes to the awarded Participants.</p>



<p>3. Personal data will be processed for the following purposes: publication of competition posts and their promotion in:</p>



<p>a. any social media of TantusData brand (LinkedIn; YouTube);</p>



<p>b. on the Organizer&#8217;s website in the domain www.tantusdata.com ;</p>



<p>4. Participants have the right to access, correct and delete processed data. Data is provided on a voluntary basis, with registration on the <strong>??</strong> submission form (e.g. Google form/name of the server) network required for participation in the Competition. The Organiser is not responsible for the way LinkedIn (the service used) processes personal data.</p>



<p>5. The Contestant has at any time the right to request the deletion of his personal data and the data of minors from his post. Upon deletion of the data, the Contestant loses the possibility to participate in the Contest.</p>



<p>6. The use of personal data takes place on global communication channels, without territorial restrictions and without time limits (until the removal of data by the Contestants or the cessation of the Organizer&#8217;s activities).</p>



<p><br><strong>§ 9 [Copyright]</strong></p>



<p></p>



<p>1. It is forbidden to infringe in any way the intellectual property rights in the Contest, especially the unauthorized use by the Contestants of works authored by third parties.</p>



<p>2. Contestants in respect of whom the Organiser has received information that they are not the authors of Contest Tasks, or do not have the rights to the answers to the Contest Task, are subject to exclusion from the Contest.<br>3. In the event that an answer to a Contest Task is a work as defined by the Polish Act on Copyright and Similar Rights, the entirety of the Participant&#8217;s economic copyright to the answer to the Contest Task and the right to permit the exercise of subsidiary rights, without any time and territorial restrictions, passes to the Organiser. The Organiser shall be entitled to use and dispose of the work – the contest posts &#8211; in the following forms of exploitation:<br>a) within the scope of recording and multiplying the work in whole or in part &#8211; the production of copies of the work using a specific technique, including digital reproduction, printing, reprography, magnetic recording and digital technique;<br>b) within the scope of dissemination of the work, in its entirety or in part, in a manner other than specified in item a) above &#8211; placing on social media, websites, publishing in the Organiser&#8217;s promotional and advertising materials, exhibiting, displaying, reproducing, as well as broadcasting and rebroadcasting, as well as making the work available to the public in such a way that everyone can have access to it in a place and time selected by themselves, including via the Internet.</p>



<p></p>



<p><strong>§ 10 [Complaints and notifications of violations]</strong></p>



<p><br>1. Any complaints regarding the way the Contest is carried out should be submitted by Contestant in writing during the Contest, but not later than within 14 (fourteen) days from the date of issuing the Prizes.<br>2. A complaint submitted after the deadline shall have no legal effect.<br>3. A written complaint should contain the name, surname, exact address of the Contestant and<br>a detailed description and justification of the complaint.<br>4. The complaint should be sent by registered mail or courier service to the address of the Organiser.<br>5. Claims will be considered in writing within 30 days from their submittance.</p>



<p><br><strong>§ 11 [Final provisions]</strong></p>



<p></p>



<p>1. The Regulations shall enter into force on the first day of the competition, i.e. July 21st 2023.<br>2. Second In matters not covered by these Regulations shall apply the provisions of the Polish Civil Code and other applicable laws.<br>3. Disputes relating to and arising from the Contest will be resolved by a court of law with jurisdiction over the Organiser&#8217;s registered office.<br>4. Organizer reserves the right to change the rules of the Contest during its duration for important reasons, with the reservation that all possible changes in the Rules will not affect the rights of Contest participants acquired on the basis of the Rules before the date of entry into force of these changes. Information about changes will be posted on the Fanpages.<br>5. A brief description of the Contest rules can be found in advertising and information materials accompanying the Contest, in particular on the Fanpages. All content included in these materials is for information purposes only. Only the provisions of these Regulations are binding.<br>6. Contest Regulations are available on Fanpages.</p>
<p>The post <a href="https://tantusdata.com/insights/the-tantusdata-summer-challenge-terms-and-conditions/">The TantusData Summer Challenge: terms and conditions</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>The Summer Challenge</title>
		<link>https://tantusdata.com/insights/the-tantusdata-summer-challenge/</link>
		
		<dc:creator><![CDATA[TantusData]]></dc:creator>
		<pubDate>Thu, 06 Jul 2023 13:58:50 +0000</pubDate>
				<guid isPermaLink="false">https://tantusdata.com/?post_type=insights&#038;p=1597</guid>

					<description><![CDATA[<p>Introducing the TantusData Summer Challenge This summer, we&#8217;re not just beating the heat, we&#8217;re challenging it. At TantusData, we believe in nurturing the approachable spirit of knowledge-sharing, and to celebrate this, we&#8217;re excited to present the TantusData Summer Challenge. Our challenge is an ode to experts in the making and a refresher for the seasoned [&#8230;]</p>
<p>The post <a href="https://tantusdata.com/insights/the-tantusdata-summer-challenge/">The Summer Challenge</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="576" src="https://tantusdata.com/app/uploads/2023/07/SummerChallenge-1-1024x576.jpg" alt="" class="wp-image-1685" srcset="https://tantusdata.com/app/uploads/2023/07/SummerChallenge-1-1024x576.jpg 1024w, https://tantusdata.com/app/uploads/2023/07/SummerChallenge-1-300x169.jpg 300w, https://tantusdata.com/app/uploads/2023/07/SummerChallenge-1-768x432.jpg 768w, https://tantusdata.com/app/uploads/2023/07/SummerChallenge-1-1536x864.jpg 1536w, https://tantusdata.com/app/uploads/2023/07/SummerChallenge-1-2048x1152.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<h2 class="wp-block-heading">Introducing the TantusData Summer Challenge</h2>



<p>This summer, we&#8217;re not just beating the heat, we&#8217;re challenging it. At TantusData, we believe in nurturing the approachable spirit of knowledge-sharing, and to celebrate this, we&#8217;re excited to present the TantusData Summer Challenge.</p>



<p>Our challenge is an ode to experts in the making and a refresher for the seasoned professionals. It’s a platform to test your skills and learn, as we will post the optimal solution with instructions later. As a cherry on top, there&#8217;s a reward &#8211; an &#8216;Office Survival Kit&#8217;, filled with a delightful mix of practical and amusing items. This includes a handy notebook, colourful post-its, essential pens, a sarcastic mug paired with a soothing tea, a stylish t-shirt, a tote, appetising cookies for that quick brain boost, and for moments of relaxation, a funny adult colouring book with crayons. To further enhance your office or workspace, we&#8217;re including a 50 EURO Amazon gift card as part of the kit. We have three of these exciting kits ready for our top participants. So buckle up and let your creative juices flow!</p>



<p>At the heart of this challenge lies a model on production topic. We teased this during recent ML conferences and now, it’s time for action. The challenge requires the innovative use of a Siamese Network solution to work with small data sets. Entries will be judged based on the quality and appropriateness of the submission. The finer details are provided below.</p>



<p>And that&#8217;s not all &#8211; all participants who solve the challenge correctly will be awarded certificates.</p>



<p>But wait! We&#8217;ve got a bonus round. A side challenge that promises more thrill and more creativity. Stay tuned to our emails and social media channels, specifically LinkedIn, for the announcement.</p>



<p>A little birdie told us that those who participate in this challenge will have a head start for our upcoming autumn challenge. So, we highly recommend you to keep your work saved.</p>



<h2 class="wp-block-heading">The how&#8217;s and when&#8217;s </h2>



<p><strong>The challenge instructions:</strong> Your task, &#8216;Authorship Comparison&#8217;, is simple but thought-provoking: create a smart solution to determine if two texts share the same author. Rather than developing a massive model, we want to see how well you can optimise a limited dataset. The submissions will be assessed based on the scores and the approaches taken. </p>



<p>All submissions must be through the Kaggle submission system, and if you&#8217;re not a member yet, creating a login is easy and free. Check out the detailed instructions and resources on the competition page <a href="https://www.kaggle.com/competitions/authorship-comparison/overview">here</a>. </p>



<p><strong>Deadlines:</strong> The Authorship Detection challenge opens on the 21st of July 2023. We strongly recommend participants to commence no later than the 11th of September 2023, to provide sufficient time to develop and fine-tune your model.</p>



<p><strong>Important Update:</strong> In response to the feedback from our enthusiastic participants, the submission deadline for the Summer Challenge has been extended. All entries will now be accepted until <strong>16th of September, 2023</strong>, 23:59 CEST. This provides additional time for both new participants to join and for existing participants to refine their submissions. We wish everyone the best of luck! No submissions will be accepted after this time, so ensure your entry is submitted promptly.</p>



<p>For beginners and those looking for extra support, we&#8217;ve provided tips and resources linked within the challenge on Kaggle. These could be very helpful, so make sure to take a look. But remember, going through these resources might take some time in addition to the time needed for the challenge task itself.</p>



<p>Get started now, and best of luck to you all!</p>



<p><strong>The Fine Print:</strong> <a href="https://tantusdata.com/insights/the-tantusdata-summer-challenge-terms-and-conditions/">Here&#8217;s</a> the link to all the terms and conditions of the challenge. Yet, the basic rules of participation are also listed on the challenge tabs in Kaggle, which will go live tomorrow. So you can view them there.</p>



<p>Get ready to compete, learn, and win. And remember, the clock is ticking. Keep an eye out for our announcements!</p>



<p>Good luck, challengers!</p>



<p>#TantusDataSummerChallenge</p>
<p>The post <a href="https://tantusdata.com/insights/the-tantusdata-summer-challenge/">The Summer Challenge</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>J On the Beach x TantusData</title>
		<link>https://tantusdata.com/insights/j-on-the-beach-x-tantusdata/</link>
		
		<dc:creator><![CDATA[TantusData]]></dc:creator>
		<pubDate>Mon, 17 Apr 2023 12:39:43 +0000</pubDate>
				<category><![CDATA[Big data technologies]]></category>
		<category><![CDATA[Conference]]></category>
		<category><![CDATA[JOTB23]]></category>
		<guid isPermaLink="false">https://tantusdata.com/?post_type=insights&#038;p=1452</guid>

					<description><![CDATA[<p>J On The Beach: Where Developers and Data Scientists Meet to learn and unlock the potential of Big Data Technologies EDUCATE, SHARE, SPREAD, AND LEARN The technical conference brings together developers, data scientists, and DevOps professionals from around the world to explore the latest trends in big data technologies. This immersive event features workshops, hackathons, [&#8230;]</p>
<p>The post <a href="https://tantusdata.com/insights/j-on-the-beach-x-tantusdata/">J On the Beach x TantusData</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="578" src="https://tantusdata.com/app/uploads/2023/04/TantusData_with_JOntheBeach23_May2023-1024x578.jpg" alt="J on The Beach 2023 conference and TantusData" class="wp-image-1455" srcset="https://tantusdata.com/app/uploads/2023/04/TantusData_with_JOntheBeach23_May2023-1024x578.jpg 1024w, https://tantusdata.com/app/uploads/2023/04/TantusData_with_JOntheBeach23_May2023-300x169.jpg 300w, https://tantusdata.com/app/uploads/2023/04/TantusData_with_JOntheBeach23_May2023-768x433.jpg 768w, https://tantusdata.com/app/uploads/2023/04/TantusData_with_JOntheBeach23_May2023-1536x867.jpg 1536w, https://tantusdata.com/app/uploads/2023/04/TantusData_with_JOntheBeach23_May2023.jpg 1914w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p>J On The Beach: Where Developers and Data Scientists Meet to learn and unlock the potential of Big Data Technologies</p>



<h2 class="wp-block-heading"><strong>EDUCATE, SHARE, SPREAD, AND LEARN</strong></h2>



<p>The technical conference brings together developers, data scientists, and DevOps professionals from around the world to explore the latest trends in big data technologies. This immersive event features workshops, hackathons, and technical talks led by top speakers in the field. Attendees will learn about a range of topics, from data collection and stream processing to machine learning, microservices, artificial intelligence, container systems, and more.</p>



<p>You can read all about this year&#8217;s edition <a href="https://www.jonthebeach.com">here</a>.</p>



<p>On the 10 to 12th of May, fantastic international speakers will join the stage to share their top tips and tricks.</p>



<p>Our own&nbsp;<a href="https://www.linkedin.com/in/ACoAAATwupEBSS-9R8o7J9xB98KPDt2eTyWjCG0">Marcin Szymaniuk</a>&nbsp;will share his best practice takeaways on the 12th of May.&nbsp;You can find more about the topic our expert will present <a href="https://lnkd.in/dZyfyAtA">here</a>.</p>



<p><br>Did we mention that it just happens to be Friday and the conference will conclude with a great party in the evening?</p>


<div class="wp-block-image">
<figure class="aligncenter size-large"><img loading="lazy" decoding="async" width="1024" height="535" src="https://tantusdata.com/app/uploads/2023/04/TantusData_data.science_talk_JOTB23-1024x535.jpg" alt="Marcin Szymaniuk from Tantus Data speaking at J on The Beach" class="wp-image-1453" srcset="https://tantusdata.com/app/uploads/2023/04/TantusData_data.science_talk_JOTB23-1024x535.jpg 1024w, https://tantusdata.com/app/uploads/2023/04/TantusData_data.science_talk_JOTB23-300x157.jpg 300w, https://tantusdata.com/app/uploads/2023/04/TantusData_data.science_talk_JOTB23-768x401.jpg 768w, https://tantusdata.com/app/uploads/2023/04/TantusData_data.science_talk_JOTB23.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure></div>


<p></p>



<p>So make sure to book your place and join us in the gorgeous Málaga for invaluable insights and strategies, you won&#8217;t learn elsewhere.</p>
<p>The post <a href="https://tantusdata.com/insights/j-on-the-beach-x-tantusdata/">J On the Beach x TantusData</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>TantusData, the game changer by Clutch</title>
		<link>https://tantusdata.com/insights/tantusdata-a-leading-service-providers-clutch/</link>
		
		<dc:creator><![CDATA[TantusData]]></dc:creator>
		<pubDate>Wed, 22 Mar 2023 14:31:21 +0000</pubDate>
				<category><![CDATA[Clutch]]></category>
		<category><![CDATA[reviews]]></category>
		<guid isPermaLink="false">http://tantusdata.local/?post_type=insights&#038;p=568</guid>

					<description><![CDATA[<p>Clutch has recently brought to our attention that we are listed as a leading service provider on Clutch; naming us as one of Poland-headquartered industry game-changer big data analytics firms. For those few who haven’t heard of Clutch yet, it is a B2B ratings and reviews platform based in Washington, DC. The company evaluates technology [&#8230;]</p>
<p>The post <a href="https://tantusdata.com/insights/tantusdata-a-leading-service-providers-clutch/">TantusData, the game changer by Clutch</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="438" src="https://tantusdata.com/app/uploads/2023/01/TantusData_header_clutch-1024x438.jpg" alt="TantusData as the leading service provider" class="wp-image-1275" srcset="https://tantusdata.com/app/uploads/2023/01/TantusData_header_clutch-1024x438.jpg 1024w, https://tantusdata.com/app/uploads/2023/01/TantusData_header_clutch-300x128.jpg 300w, https://tantusdata.com/app/uploads/2023/01/TantusData_header_clutch-768x328.jpg 768w, https://tantusdata.com/app/uploads/2023/01/TantusData_header_clutch.jpg 1520w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<h2 class="wp-block-heading">Clutch has recently brought to our attention that we are <a href="https://clutch.co/pl/it-services/analytics">listed as</a> a leading service provider on Clutch; naming us as one of Poland-headquartered industry game-changer big data analytics firms.</h2>



<p style="font-size:18px">For those few who haven’t heard of Clutch yet, it is a B2B ratings and reviews platform based in Washington, DC. The company evaluates technology service and solutions companies based on the quality of work, thought leadership, and other key highlights from clients’ reviews.</p>



<p style="font-size:18px">We are a data company specialising in big data solutions. Subsequently, we excel at data platforms, data-driven systems, analytics, and advanced machine learning. As such, our services range from data infrastructure and cloud optimisation to ELT and re-engineering mission-critical systems. You can see a selection of our case studies at Clutch in the portfolio section.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="535" src="https://tantusdata.com/app/uploads/2023/01/TantusData_clutch2-1024x535.jpg" alt="Case study of TantusData on the Clutch profile." class="wp-image-1278" srcset="https://tantusdata.com/app/uploads/2023/01/TantusData_clutch2-1024x535.jpg 1024w, https://tantusdata.com/app/uploads/2023/01/TantusData_clutch2-300x157.jpg 300w, https://tantusdata.com/app/uploads/2023/01/TantusData_clutch2-768x401.jpg 768w, https://tantusdata.com/app/uploads/2023/01/TantusData_clutch2.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<h2 class="wp-block-heading">Let&#8217;s hear it from the important ones</h2>



<p>Our clients’ honest and thorough reviews on Clutch have a tremendous impact on us. Because, their feedback helps us improve our services and establish our name in the industry. They also show that we do deliver in a manner that fits our values. Today, we will share some snippets of the reviews we have received so far.</p>



<p><em>“We chose TantusData because of how flexible they were in the way of delivery. It was important for us to have their expert working alongside our internal team full time.” – Principal Engineer, UK’s fastest growing online accommodation marketplaces.</em></p>



<p>This doesn’t come as a surprise. After all, flexibility is at our core. Indeed, we start by focusing on you, to meet where and when you need it, and bring a solution that best fits your company and situation. Therefore, we guarantee the results: what we promise, we deliver. Finally, from start to finish, your project is only taken care of by experts only. Those who have long standing experience in the industry.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="620" src="https://tantusdata.com/app/uploads/2023/03/TantusData_Clutch-1024x620.jpg" alt="The reviews on Clutch for TantusData" class="wp-image-1285" srcset="https://tantusdata.com/app/uploads/2023/03/TantusData_Clutch-1024x620.jpg 1024w, https://tantusdata.com/app/uploads/2023/03/TantusData_Clutch-300x182.jpg 300w, https://tantusdata.com/app/uploads/2023/03/TantusData_Clutch-768x465.jpg 768w, https://tantusdata.com/app/uploads/2023/03/TantusData_Clutch-1536x930.jpg 1536w, https://tantusdata.com/app/uploads/2023/03/TantusData_Clutch-2048x1240.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p><em>“The onboarding and collaboration was smooth since both technical and communication skills of the team were very good”. – </em>Director of Data &amp; Advanced Analytics, large scale retail company</p>



<p>We speak your language<strong> </strong>and ensure that the problem is not only solved but also in a way and at a pace convenient to you. Whether you have a long-term project, need extra hands on board mid-delivery, or need some crucial last-minute help — we’re quick to respond.&nbsp;</p>



<p><em>“Effective communication and very quick turnaround when unexpected problems arise.” – CTO, 360 Restaurant Management System provider</em></p>



<script type="text/javascript" src="https://widget.clutch.co/static/js/widget.js"></script> <div class="clutch-widget" data-url="https://widget.clutch.co" data-widget-type="5" data-height="auto" data-nofollow="true" data-expandifr="true" data-scale="100" data-header-color="#17313B" data-footer-color="#17313B" data-scale="100" data-primary-color="#08537E" data-secondary-color="#08537E" data-clutchcompany-id="1972193"></div>



<p></p>



<p><em>“One of the key resources of the company is their unique skill set. Such a mix of data science and engineering coupled with clear understanding of rules of GDPR and ability to tune performance is hard to find on the market.” – Head of Data Science, one of the largest telecom companies in Europe</em></p>



<p>We are a flexible, agile, and unequaled group of experts. This means we can address any situation and provide the best solutions, 100% tailored and effective long-term. So, it’s not about the new and popular solutions but the ones that will serve you best.</p>



<p><em>“I appreciated that they investigated several solutions and delivered them to us with an explanation. They genuinely cared about the outcomes of the models they built and stayed actively involved throughout split testing on live environments.” – Principal Engineer, UK’s fastest growing online accommodation marketplaces.</em></p>



<p>Simultaneously, TantusData ensures you have all the knowledge you need straight from our specialized team so that you can fully leverage the solution to create value but also confidently make decisions regarding data.<br><em>“We could see that they really cared about meeting our needs, but most importantly about the project long term, that it goes well.”</em> – CTA, Depict.ai</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="535" src="https://tantusdata.com/app/uploads/2023/01/TantusData_clutch1-1024x535.jpg" alt="TantusData portfolio on the Clutch platform" class="wp-image-1280" srcset="https://tantusdata.com/app/uploads/2023/01/TantusData_clutch1-1024x535.jpg 1024w, https://tantusdata.com/app/uploads/2023/01/TantusData_clutch1-300x157.jpg 300w, https://tantusdata.com/app/uploads/2023/01/TantusData_clutch1-768x401.jpg 768w, https://tantusdata.com/app/uploads/2023/01/TantusData_clutch1.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<h2 class="wp-block-heading">Our badges</h2>



<p>The whole team at TantusData is proud to be the recipients of the special Clutch Badges for the top providers. To be sure, we will continue to go the extra mile, no matter the project or business size. We are delighted to see our clients appreciate our hard work. If you’re curious to see more about the reviews, check out our <a href="https://clutch.co/profile/tantusdata#summary">Clutch profile</a>. There you can also view selected projects from our portfolio.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="535" src="https://tantusdata.com/app/uploads/2023/03/TantusData_clutch_nagrody-1024x535.jpg" alt="Clutch badges TantusData has received." class="wp-image-1282" srcset="https://tantusdata.com/app/uploads/2023/03/TantusData_clutch_nagrody-1024x535.jpg 1024w, https://tantusdata.com/app/uploads/2023/03/TantusData_clutch_nagrody-300x157.jpg 300w, https://tantusdata.com/app/uploads/2023/03/TantusData_clutch_nagrody-768x401.jpg 768w, https://tantusdata.com/app/uploads/2023/03/TantusData_clutch_nagrody.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p>We help our clients get up to speed with taking advantage of their data. <a href="https://tantusdata.com/#contact">Contact us</a> now!</p>
<p>The post <a href="https://tantusdata.com/insights/tantusdata-a-leading-service-providers-clutch/">TantusData, the game changer by Clutch</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>SmartData conference 2021</title>
		<link>https://tantusdata.com/insights/smart-data-conference-2021/</link>
		
		<dc:creator><![CDATA[Maryna Dubavets]]></dc:creator>
		<pubDate>Tue, 22 Feb 2022 09:59:00 +0000</pubDate>
				<category><![CDATA[airflow]]></category>
		<category><![CDATA[big data events]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[SmartData conference]]></category>
		<guid isPermaLink="false">http://tantusdata.local/?post_type=insights&#038;p=580</guid>

					<description><![CDATA[<p>The SmartData online conference organized by the JUG.ru committee from Russia took place on Oct 11-14. It was full of interesting presentations and discussions going in 3 threads in parallel. I&#8217;d like to briefly state here some of the topics presented. Apache Airflow 2.3 and beyond: What comes next? By Ash Berlin-Taylor from Astronomer.io Ash [&#8230;]</p>
<p>The post <a href="https://tantusdata.com/insights/smart-data-conference-2021/">SmartData conference 2021</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="576" src="https://tantusdata.com/app/uploads/2023/03/Smart_data_conference-1024x576.jpg" alt="Conferences and meetings" class="wp-image-1291" srcset="https://tantusdata.com/app/uploads/2023/03/Smart_data_conference-1024x576.jpg 1024w, https://tantusdata.com/app/uploads/2023/03/Smart_data_conference-300x169.jpg 300w, https://tantusdata.com/app/uploads/2023/03/Smart_data_conference-768x432.jpg 768w, https://tantusdata.com/app/uploads/2023/03/Smart_data_conference-1536x863.jpg 1536w, https://tantusdata.com/app/uploads/2023/03/Smart_data_conference-2048x1151.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p style="font-size:18px">The SmartData online conference organized by the JUG.ru committee from Russia took place on Oct 11-14. It was full of interesting presentations and discussions going in 3 threads in parallel. I&#8217;d like to briefly state here some of the topics presented.</p>



<h2 class="wp-block-heading">Apache Airflow 2.3 and beyond: What comes next? By Ash Berlin-Taylor from Astronomer.io</h2>



<p>Ash has been a contributor to Airflow for almost four years. He was the Release Manager for majority of the 1.10 release series and he also re-wrote a lot of the Scheduler internals to be highly-available and increase performance by an order of magnitude. Outside of Airflow he is the Director of Airflow Engineering at Astronomer.io where he runs the team of developers contributing to the open source Airflow project.</p>



<p>Ash started by mentioning Airflow recent achievements, like performance 10-100 time speed up of Airflow 2.0 released in December last year. Airflow 2.2, which is about to be released, contains following updates:</p>



<p>i. AIP-39: Run DAGs on customisable schedules . Confusing ?execution date? deprecated, introduced ?logical_date?, ?data_interval_start?, ?data_interval_end? instead.</p>



<p>ii. AIP-40: Any operator could ?defer? itself. Deferrable(async) operators are generalizations of&nbsp; smart sensors introduced earlier. Helps to avoid&nbsp; waste of resources while waiting for external dependency to complete.</p>



<p>Roadmap for version 2.3 is not yet finalised and Ash presented his own vision of possible Airflow&#8217;s future.&nbsp;</p>



<p>1. DAG should be a joy to write and read which makes Airflow an orchestrator choice for any workflow. Make it easier to operate confidentially.</p>



<p>2. Dynamic DAGs.&nbsp; Mapped task concept could make it possible to launch as many copies of operators, like map tasks, to process all files in parallel. Parametrized DAG defined once could be used with different parameters.</p>



<p>3. Get rid of the UNIQUE constraint on execution_date value to allow scheduling multiple DAGs at same time.</p>



<p>4. Introduce airflowctl: CLI over REST API.</p>



<p>5. Solve ?untrusted worker? problem – set access control per connection.</p>



<p>6. Make it easier to assign lifecycle hooks and DAG notifications, like send slack notification on failure.</p>



<p>7. Better cross-DAG story: event-triggered DAGs and introduction of a concept of Data object. It should be possible to bind a storage folder with Data object reference and assign a hook which triggers task execution upon content change. When DAG execution is completed all temporary files could be automatically cleaned up.</p>



<p>Among even more distant future ideas these were presented the following:</p>



<p>1. To append DAG versioning and reflect the historical state of DAG on UI. Having introduced versioning will help to simplify DAG deployments.&nbsp;</p>



<p>2. Streaming support and better support for microbatch and long running batch jobs.</p>



<p>3. Support and integration with Machine Learning tools, model hosting, comparing models.</p>



<h2 class="wp-block-heading">Big Data Tool presentation by Oleg Chiruhin from JetBrains</h2>



<p>Big Data Tool is a powerful plugin for IntellijIdea which allows to work with Zeppelin notebooks, monitor Spark and Hadoop applications and explore cloud storage systems.</p>



<p>I. Browser file storage. Possible options are local FS, HDFS, Google Storage, S3, Azure, Minio and some more. When local and cloud storages are configured it is possible to upload files in any direction. Also it?s possible to preview remote file sample with different view options – plain text(for csv) or table view:</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="535" src="https://tantusdata.com/app/uploads/2023/01/TantusData_Table-view-of-.parquet-file-in-Big-Data-Tools-1024x535.jpg" alt="Table view of .parquet file in Big Data Tools" class="wp-image-1300" srcset="https://tantusdata.com/app/uploads/2023/01/TantusData_Table-view-of-.parquet-file-in-Big-Data-Tools-1024x535.jpg 1024w, https://tantusdata.com/app/uploads/2023/01/TantusData_Table-view-of-.parquet-file-in-Big-Data-Tools-300x157.jpg 300w, https://tantusdata.com/app/uploads/2023/01/TantusData_Table-view-of-.parquet-file-in-Big-Data-Tools-768x401.jpg 768w, https://tantusdata.com/app/uploads/2023/01/TantusData_Table-view-of-.parquet-file-in-Big-Data-Tools.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p>II. Zeppelin notebook editor with ssh tunnelling for accessing remote notebooks within a restricted network.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="535" src="https://tantusdata.com/app/uploads/2023/01/TantusData_Ssh-tunneling-configuration-for-Zeppelin-connection-1024x535.jpg" alt="SSH tunneling configuration for Zeppelin connection" class="wp-image-1298" srcset="https://tantusdata.com/app/uploads/2023/01/TantusData_Ssh-tunneling-configuration-for-Zeppelin-connection-1024x535.jpg 1024w, https://tantusdata.com/app/uploads/2023/01/TantusData_Ssh-tunneling-configuration-for-Zeppelin-connection-300x157.jpg 300w, https://tantusdata.com/app/uploads/2023/01/TantusData_Ssh-tunneling-configuration-for-Zeppelin-connection-768x401.jpg 768w, https://tantusdata.com/app/uploads/2023/01/TantusData_Ssh-tunneling-configuration-for-Zeppelin-connection.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p>BDT appends rich scala autocomplete functionality missing in the web interface for Zeppelin.There is also ability to add&nbsp; external modules and jars for autocompletion and in-place documentation, not only standard library.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="535" src="https://tantusdata.com/app/uploads/2023/01/TantusData_Autocompletion_in_BDT-1024x535.jpg" alt="Autocompletion in BDT" class="wp-image-1302" srcset="https://tantusdata.com/app/uploads/2023/01/TantusData_Autocompletion_in_BDT-1024x535.jpg 1024w, https://tantusdata.com/app/uploads/2023/01/TantusData_Autocompletion_in_BDT-300x157.jpg 300w, https://tantusdata.com/app/uploads/2023/01/TantusData_Autocompletion_in_BDT-768x401.jpg 768w, https://tantusdata.com/app/uploads/2023/01/TantusData_Autocompletion_in_BDT.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p>Navigation to code declaration and other opportunities of IDE are supported.</p>



<p>III. Markdown support</p>



<p>IV. Easy imports</p>



<p>V. Intermediate type visibility</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="535" src="https://tantusdata.com/app/uploads/2023/01/TantusData_returning_types_in_grey-1024x535.jpg" alt="Returning types in grey" class="wp-image-1308" srcset="https://tantusdata.com/app/uploads/2023/01/TantusData_returning_types_in_grey-1024x535.jpg 1024w, https://tantusdata.com/app/uploads/2023/01/TantusData_returning_types_in_grey-300x157.jpg 300w, https://tantusdata.com/app/uploads/2023/01/TantusData_returning_types_in_grey-768x401.jpg 768w, https://tantusdata.com/app/uploads/2023/01/TantusData_returning_types_in_grey.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p>VI. Plot execution results</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="535" src="https://tantusdata.com/app/uploads/2023/01/TantusData_result_chart-1024x535.jpg" alt="Result chart" class="wp-image-1306" srcset="https://tantusdata.com/app/uploads/2023/01/TantusData_result_chart-1024x535.jpg 1024w, https://tantusdata.com/app/uploads/2023/01/TantusData_result_chart-300x157.jpg 300w, https://tantusdata.com/app/uploads/2023/01/TantusData_result_chart-768x401.jpg 768w, https://tantusdata.com/app/uploads/2023/01/TantusData_result_chart.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p>VII. Ability to monitor job execution </p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="535" src="https://tantusdata.com/app/uploads/2023/01/TantusData_Hadoop-monitoring-int-BDT-1024x535.jpg" alt="Hadoop monitoring in BDT" class="wp-image-1304" srcset="https://tantusdata.com/app/uploads/2023/01/TantusData_Hadoop-monitoring-int-BDT-1024x535.jpg 1024w, https://tantusdata.com/app/uploads/2023/01/TantusData_Hadoop-monitoring-int-BDT-300x157.jpg 300w, https://tantusdata.com/app/uploads/2023/01/TantusData_Hadoop-monitoring-int-BDT-768x401.jpg 768w, https://tantusdata.com/app/uploads/2023/01/TantusData_Hadoop-monitoring-int-BDT.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<h2 class="wp-block-heading">Hadoop 3: Erasure coding catastrophe by Denis Efarov from OK (Odnoklassniki)</h2>



<p>Denis Efarov, a lead developer at&nbsp;Odnoklassniki, has been working with Big data since 2013. He has designed and developed the platform for storing and processing data for the Odnoklassniki project since 2018.</p>



<p>Denis told an extremely exciting and dramatic story about migration from Hadoop 2.7.3 to 3.1.4 and back, which cost the company more than 1 year of useless work and around 115 Tb of lost production data.&nbsp;</p>



<p>Hadoop 3 introduced erasure coding mechanism instead of replication to protect against data loss. This mechanism appends 3 additional bytes of Reed-Solomon codes after each 6 bytes of original data (parity block). It leads to only 50% disc redundancy compared to 200% in case of replication with factor 3x with similar reliability guarantee. Of course the encoding/decoding process slows down read/write performance, but space preservation was a priority and in scope of OK it was tens of petabytes scale.&nbsp;</p>



<p>The whole migration process was expected to take 8-10 month, but after half a year they noticed that some parquet files were broken. At first glance damage seemed to appear randomly, but after deep analysis they detected an issue with parity blocks cleanup (see these links:&nbsp;<a href="https://issues.apache.org/jira/browse/HDFS-14768" target="_blank" rel="noreferrer noopener">https://issues.apache.org/jira/browse/HDFS-14768</a>&nbsp;and&nbsp;<a href="https://issues.apache.org/jira/browse/HDFS-15186" target="_blank" rel="noreferrer noopener">https://issues.apache.org/jira/browse/HDFS-15186</a>). By this time 2 out of 3 redundant clusters were already migrated and 4 000 000 of files were broken. 90% was restored from the remaining backup cluster. In order to restore the remaining part they developed a tricky decoding tool which was trying to guess original data from all possible byte combinations and based on parquet file structure. This way about 9.9% of files were restored. And approximately 40 000 production files (115 Tb) were lost forever.</p>
<p>The post <a href="https://tantusdata.com/insights/smart-data-conference-2021/">SmartData conference 2021</a> appeared first on <a href="https://tantusdata.com">TantusData</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>
