[Vaccination 2021] Apache Arrow: High-Performance Columnar Data Framework (Wes McKinney)
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing. With the aim to make the data ecosystem modular and connected, Wes will talk about Apache Arrow’s vision for a future more unified data analytics ecosystem. In this talk, Wes will discuss the underlying interfaces and protocols powering the project, trends in the Apache Arrow ecosystem, and work on Arrow-native query processing. We will also discuss the new Substrait initiative for portable logical query plans across physical execution backends.
This talk is part of the Vaccination Database (Second Dose) Tech Talk Seminar Series.
Wes McKinney is an open source software developer focusing on analytical computing. He created the Python pandas project and is a co-creator of Apache Arrow, his current focus. He authored two editions of the book Python for Data Analysis. Wes is a member of The Apache Software Foundation and also a PMC member for Apache Parquet. He is now the CTO of Voltron Data, a new startup working on accelerated computing technologies powered by Apache Arrow.
More Info: https://db.cs.cmu.edu/seminar2021-dose2#db13