English

Presto: SQL-on-Anything

Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. Presto was designed and written from the ground up for interactive analytics and approaches the speed of commercial data warehouses while scaling to the size of organizations like Facebook. One key feature in Presto is the ability to query data where it lives via an uniform ANSI SQL interface. Presto’s connector architecture creates an abstraction layer for anything that can be represented in a columnar or row-like format, such as HDFS, Amazon S3, Azure Storage, NoSQL stores, relational databases, Kafka streams and even proprietary data stores. Furthermore, a single Presto query can combine data from multiple sources, allowing for analytics across an entire organization.

This talk will be co-presented by Wojciech Biela and Grzegorz Kokosiński from Starburst, the enterprise Presto company, largest contributor to Presto outside of Facebook. The talk will be a gentle introduction to Presto and its ability to query virtually any data source via it’s connector interface. Wojciech and Piotr will also present some of the use cases of Presto querying various data sources and briefly describe the backing architectural concepts.

Wojciech Biela
Director, Product Development, Starburst Data

Wojciech Biela is a co-founder of Starburst and is responsible for product development. He has a background of over 13 years of building products and running engineering teams. Previously Wojciech was the Engineering Manager at the Teradata Center for Hadoop, running the Presto engineering operations in Warsaw, Poland. Prior to that, back in 2011, he built and ran the Polish engineering team, a subsidiary of Hadapt Inc., a pioneer in the SQL-on-Hadoop space. Hadapt was acquired by Teradata in 2014. Earlier, Wojciech built and lead teams on multi-year projects, from custom big e-commerce & SCM platforms to PoS systems. Wojciech holds a M.S. in Computer Science from the Wroclaw University of Technology.

Grzegorz Kokosiński
Experienced engineer, Founding member, Starburst Data

Grzegorz Kokosiński is an experienced engineer and a founding member of the Starburst team. He participates in development of Presto’s significant features like the cost-based optimizer, correlated subqueries and more. Previously at Teradata Labs he was also engaged in Presto’s integration with the Teradata database. Grzegorz was a core developer at Hadapt before its acquisition by Teradata in 2014. Before that he worked at IBM/Netezza. A graduate of Warsaw University of Technology.