Select your language
In the previous article, we covered anonymization and pseudonymization – techniques used in the context of ensuring data privacy, and more specifically, in the…
OVERVIEW Large Language Models (LLMs) can now use plug-ins to access extra tools. But they often respond slowly when these tools are used. This hurts…
Hands-on workshop on building GPT-powered apps with LangChain, Agents, embeddings, and vector databases. Learn to create accurate, real-time AI solutions.
Cloud Data Platform migrations come with hidden exit costs. Learn how to reduce vendor lock-in risk through smart architecture and technology choices.
Navigating the complexities of deploying open-source Large Language Models (LLMs) can be daunting. From understanding licensing restrictions and making crucial decisions about accuracy, speed, and cost trade-offs, to comprehending benchmark evaluations and exploring deployment strategies, this guide provides essential insights for leveraging open-source LLMs effectively in your projects.
Learn how to utilize the RDD API in Apache Spark to check partition details or perform low-level operations. Despite being deprecated, the RDD API is accessible via the .rdd method on Datasets and DataFrames. Discover how to check the number of partitions with the getNumPartitions method and determine partition sizes using the glom function. Explore the remaining useful operations that RDD API offers for low-level hacking and internal Spark tasks.
Understanding Spark's .as[T] Method: Best Practices and Defensive Programming
Whether you’ve joined us in the past or are planning to attend our upcoming events, there’s always something exciting on the horizon. Let’s take a…
TantusData named a top B2B company for Qlik, Hadoop, Tableau, Big Data Compliance, Fraud, & Risk Management services.
A strategic Overview for Decision Makers Entering the realm of big data solutions marks a transformative step for any organisation, demanding a blend of strategic…
In the first article on Monitoring Airflow jobs with TIG, “System Metrics”, we have seen an example of Airflow installation with a TIG stack set…
Explore the critical importance of auditing your big data solutions to ensure efficiency, accuracy, and sustainability. Learn how regular audits can uncover hidden issues, optimise operations, and maintain data integrity in our comprehensive guide.
Like many server applications, Airflow can – and should – be monitored for metrics and logs. In this article, we will look into the former…