Organize your data

Make them easily accessible through all your devices, scalable, understandable and easy to analyse

More info

Make your data rock

Model your data, interprete them and take appropriate actions

More Info

Save Your Time

Automate your analysis and save time and money right now

More Info

Stat'Rgy news

Stat'Rgy in action

Latest Posts

Scala vs Python

Scala vs python If you are wondering whether you’d better learn Scala or Python… or both, you might want to read this. Scala is a statically typed language, which means that the type of the variable is known at compile time (the programmer must specify what type each variable is). Python on the contrary is […]

Continue reading

HBase response times

HBase response times There are several causes to latency in HBase: Normal network round-trip time (RTT), internal or external to the Hadoop cluster, order of ms Disk read and write time, order of ms Client retries due to moved regions of splits, order of s; HBase can move a region if it considers that the […]

Continue reading

HBase on low memory cluster

How to configure HBase on low memory cluster Reduce the number of regions per server Before getting into math, let’s recall briefly what the memstore is in HBase. The memstore holds in-memory modifications to the Store before it is flushed to the Store as HFile, in other words the data that are coming in are […]

Continue reading

Optimizing Hive queries

Tips for efficient Hive queries Hive on Hadoop is a great data processing tool which is easy to use given its SQL-like syntax. Some tips to optimize Hive queries are described in this article. Typically there are 3 areas where you can optimize you Hive queries: data layout (partitions, buckets) data sampling data processing (map […]

Continue reading

Stat'Rgy works with all kinds of companies: from International groups to small- or medium-sized companies