Events: impala

Events: impala

May 1

2017

04:45pm EDT
GHC 8102
May 1 2017
[DB Seminar] Spring 2017: Marcel Kornacker
Speaker:
Marcel Kornacker
System:
Impala

Running real-time data-intensive applications on Apache Hadoop requires complex architectures to store and query data, typically involving multiple independent systems that are tied together through custom-engineered pipelines. A common pattern is to use a NoSQL engine like Apache HBase for caching and later transformations, the results of which are periodically written to HDFS in one of the popular open columnar... Read More

Oct 22

2014

06:00pm EDT
Wean Hall 5302
Oct 22 2014
Impala: A Modern, Open-Source SQL Engine for Hadoop (Ippokratis Pandis)
Speaker:
Ippokratis Pandis
System:
Impala

The Cloudera Impala project is pioneering the next generation of Hadoop capabilities: the convergence of fast SQL queries with the capacity, scalability, and flexibility of a Hadoop cluster. With Impala, the academic and Hadoop communities now have an open-sourced codebase that helps query data stored in HDFS and Apache HBase in real time, using familiar SQL syntax. In contrast with... Read More

Oct 24

2014

03:00pm EDT
GHC 6115
Oct 24 2014
Eliminating Unscalable Communication in Transaction Processing, Toward Bionic Databases (Ippokratis Pandis)
Speaker:
Ippokratis Pandis
System:
Impala

On-line transaction processing (OLTP) is one of the two most important enterprise data management applications. Transaction processing workloads typically exhibit high concurrency and provide ample opportunities for parallel execution by multicore hardware. Unfortunately, due to the characteristics of the application, transaction processing systems must moderate and coordinate communication between independent agents. As a result, transaction processing systems cannot always convert... Read More