Category Archives: SQL-on-Hadoop

Lakehouse

I have just read the “Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics” paper and decided to write a short blog post going through some of the key moments of the paper’s motivation. Let’s start.

Continue reading →

Apache Spark Future

45 Replies

Everyone around the internet is constantly talking about the bright future of Apache Spark. How cool it is, how innovative it is, how fast it is moving, how big its community is, how big the investments into it are, etc. But what is really hiding behind this enthusiasm of Spark adepts, and what is the real future of Apache Spark?

Predicting Apache Spark Future

In this article I show you the real data and real trends, trying to be as agnostic and unbiased as possible. This article is not affiliated with any vendor.

Continue reading →

Apache HAWQ: Next Step in MPP

2 Replies

The first blog post of mine is accepted to official Pivotal blog! Feel free to comment and share your opinion on the subject:

https://blog.pivotal.io/big-data-pivotal/products/apache-hawq-next-step-in-massively-parallel-processing

Modern Data Architecture Talk

2 Replies

Here is the video of my talk on Modern Data Architecture from Java Day Kiev 2015

The slides are available here: Modern Data Architecture – JD Kiev v05

Spark Architecture Video

6 Replies

This is the talk I made on Java Day Kiev 2015. It was a great conference after all

MPP vs Hadoop Talk

10 Replies

Today I had a great talk at the Hadoop User Group Ireland meetup in Dublin, and it was an adapted and refactored version of the article on the same subject, MPP vs Hadoop. Here are the slides:

Feel free to comment and share your opinion on this subject

Apache HAWQ Architecture Talk

2 Replies

Finally I have translated my talk from Highload++ 2015 conference in Moscow into English, so now you can enjoy the fresh information about the Apache HAWQ internals!

If you’d like to download the slides, you can find them here: HAWQ Architecture HL++ 2015 Moscow

Spark Architecture Talk

2 Replies

Here are the slides for the talk I just gave at JavaDay Kiev about the architecture of Apache Spark, its internals like memory management and shuffle implementation:

If you’d like to download the slides, you can find them here: Spark Architecture – JD Kiev v04

Modern Data Architecture

2 Replies

Here are the slides for the talk I just gave at JavaDay Kiev about the modern data architecture and different modern approaches of data processing:

If you’d like to download the slides, you can find them here: Modern Data Architecture – JD Kiev v05

Cloudera Kudu: Catching a Unicorn

Distributed Systems Architecture

brought to you by Alexey Grishchenko

Category Archives: SQL-on-Hadoop

Lakehouse

Apache Spark Future

Apache HAWQ: Next Step in MPP

Modern Data Architecture Talk

Spark Architecture Video

MPP vs Hadoop Talk

Apache HAWQ Architecture Talk

Spark Architecture Talk

Modern Data Architecture

Cloudera Kudu: Catching a Unicorn