The technicalities

obracajacy sie tekst
podwojna strzalka w dol
all articles
GDPR
right to be forgotten
data erasure
anonymisation
Big Data ouTDo 10 min read

GDPR, the forgotten done right

In the previous article, we covered anonymization and pseudonymization – techniques used in the context of ensuring data privacy, and more specifically, in the…

check out.
storage organisation
query performance
The technicalities 10 min read

How does storage organisation affect query performance?

Despite great efforts to separate interface from the implementation (like SQL), the pesky details always come up important when deploying to production, either when performance…

check out.
storage_organisation_vs_query_performance_TantusData
query performance
storage organisation
schema distribution
part 2
The technicalities 10 min read

Storage organisation vs query performance – examples

The article How does storage organisation affect query performance described a number of principles on how to model data in Amazon…

check out.
big_data_gdpr
GDPR
anonymisation
pseudonymisation
compliance
The technicalities 10 min read

Obtaining value from GDPR with solutions that work for your bottom line.

Compliance with the GDPR regulations can be profitable when done right. Apart from saving on legal fees and avoiding customer attrition, you can also…

check out.
AWS
cost optimisation
The technicalities 10 min read

How to waste money in the cloud

Expense optimization is often the main reason for migrating from on-premise to the cloud. The combination of pay-as-you-go and flexible provisioning reduces the problem of…

check out.
SPARK
repartition
The technicalities 10 min read

Spark shuffle – Case #3 – using salt in repartition

Why use salt in repartition? In the previous blog entry we saw how a skew in a processed dataset is affecting performance of Spark…

check out.