- Aerospike
- Alibaba
- Anna
- APOLLO
- Azure Cosmos DB
- BigQuery
- Bodo
- Cassandra
- Chroma
- ClickHouse
- Confluent
- CouchDB
- CrocodileDB
- DataFusion
- Datomic
- Debezium
- Dremio
- DuckDB
- EdgeDB
- Exon
- FASTER
- FeatureBase
- Firebolt
- FoundationDB
- Gel
- Google Spanner
- Greenplum
- HarperDB
- Impala
- Jepsen
- Kinetica
- LanceDB
- Litestream
- Malloy
- MariaDB
- MemSQL
- Modin
- MongoDB
- Napa
- NoisePage
- NuoDB
- OpenDAL
- OtterTune
- ParadeDB
- Pinot
- PostgresML
- PRQL
- QMDB
- QuestDB
- Redshift
- RisingWave
- Rockset
- rqlite
- Samza
- SingleStore
- SLOG
- Snowflake
- SpiceDB
- SplinterDB
- SQL Server
- SQLite
- Stardog
- Striim
- Swarm64
- Technical University of Munich
- TiDB
- TileDB
- Tokutek
- Umbra
- Vertica
- VoltDB
- WiredTiger
- YugabyteDB
- Akamas
- AlloyDB
- ApertureDB
- Arrow
- Berkeley DB
- BlazingDB
- Brytlyt
- Chaos Mesh
- Citus
- CockroachDB
- Convex
- CrateDB
- Databricks
- Datometry
- dbt
- Dolt
- Druid
- DVMS
- EraDB
- eXtremeDB
- Fauna
- Featureform
- Fluree
- Gaia
- GlareDB
- GoogleSQL
- GreptimeDB
- Heron
- InfluxDB
- kdb
- ksqlDB
- LeanStore
- LMDB
- MapD
- Materialize
- Milvus
- MonetDB
- MySQL
- Neon
- Noria
- OceanBase
- Oracle
- OxQL
- Pinecone
- PlanetScale
- PostgreSQL
- Qdrant
- QuasarDB
- RavenDB
- RelationalAI
- RocksDB
- RonDB
- SalesForce
- ScyllaDB
- sled
- Smooth
- Spice.ai
- Splice Machine
- SQL Anywhere
- SQLancer
- SQream
- StarRocks
- Summingbird
- Synnada
- TerminusDB
- TigerBeetle
- TimescaleDB
- Trino
- Velox
- Vitesse
- Weaviate
- Yellowbrick
- Aerospike
- AlloyDB
- APOLLO
- Berkeley DB
- Bodo
- Chaos Mesh
- ClickHouse
- Convex
- CrocodileDB
- Datometry
- Debezium
- Druid
- EdgeDB
- eXtremeDB
- FeatureBase
- Fluree
- Gel
- GoogleSQL
- HarperDB
- InfluxDB
- Kinetica
- LeanStore
- Malloy
- Materialize
- Modin
- MySQL
- NoisePage
- OceanBase
- OtterTune
- Pinecone
- PostgresML
- Qdrant
- QuestDB
- RelationalAI
- Rockset
- SalesForce
- SingleStore
- Smooth
- SpiceDB
- SQL Anywhere
- SQLite
- StarRocks
- Swarm64
- TerminusDB
- TileDB
- Trino
- Vertica
- Weaviate
- YugabyteDB
- Akamas
- Anna
- Arrow
- BigQuery
- Brytlyt
- Chroma
- CockroachDB
- CouchDB
- Databricks
- Datomic
- Dolt
- DuckDB
- EraDB
- FASTER
- Featureform
- FoundationDB
- GlareDB
- Greenplum
- Heron
- Jepsen
- ksqlDB
- Litestream
- MapD
- MemSQL
- MonetDB
- Napa
- Noria
- OpenDAL
- OxQL
- Pinot
- PostgreSQL
- QMDB
- RavenDB
- RisingWave
- RonDB
- Samza
- sled
- Snowflake
- Splice Machine
- SQL Server
- SQream
- Striim
- Synnada
- TiDB
- TimescaleDB
- Umbra
- Vitesse
- WiredTiger
- Alibaba
- ApertureDB
- Azure Cosmos DB
- BlazingDB
- Cassandra
- Citus
- Confluent
- CrateDB
- DataFusion
- dbt
- Dremio
- DVMS
- Exon
- Fauna
- Firebolt
- Gaia
- Google Spanner
- GreptimeDB
- Impala
- kdb
- LanceDB
- LMDB
- MariaDB
- Milvus
- MongoDB
- Neon
- NuoDB
- Oracle
- ParadeDB
- PlanetScale
- PRQL
- QuasarDB
- Redshift
- RocksDB
- rqlite
- ScyllaDB
- SLOG
- Spice.ai
- SplinterDB
- SQLancer
- Stardog
- Summingbird
- Technical University of Munich
- TigerBeetle
- Tokutek
- Velox
- VoltDB
- Yellowbrick
- Aerospike
- Anna
- Azure Cosmos DB
- Bodo
- Chroma
- Confluent
- CrocodileDB
- Datomic
- Dremio
- EdgeDB
- FASTER
- Firebolt
- Gel
- Greenplum
- Impala
- Kinetica
- Litestream
- MariaDB
- Modin
- Napa
- NuoDB
- OtterTune
- Pinot
- PRQL
- QuestDB
- RisingWave
- rqlite
- SingleStore
- Snowflake
- SplinterDB
- SQLite
- Striim
- Technical University of Munich
- TileDB
- Umbra
- VoltDB
- YugabyteDB
- Akamas
- ApertureDB
- Berkeley DB
- Brytlyt
- Citus
- Convex
- Databricks
- dbt
- Druid
- EraDB
- Fauna
- Fluree
- GlareDB
- GreptimeDB
- InfluxDB
- ksqlDB
- LMDB
- Materialize
- MonetDB
- Neon
- OceanBase
- OxQL
- PlanetScale
- Qdrant
- RavenDB
- RocksDB
- SalesForce
- sled
- Spice.ai
- SQL Anywhere
- SQream
- Summingbird
- TerminusDB
- TimescaleDB
- Velox
- Weaviate
- Alibaba
- APOLLO
- BigQuery
- Cassandra
- ClickHouse
- CouchDB
- DataFusion
- Debezium
- DuckDB
- Exon
- FeatureBase
- FoundationDB
- Google Spanner
- HarperDB
- Jepsen
- LanceDB
- Malloy
- MemSQL
- MongoDB
- NoisePage
- OpenDAL
- ParadeDB
- PostgresML
- QMDB
- Redshift
- Rockset
- Samza
- SLOG
- SpiceDB
- SQL Server
- Stardog
- Swarm64
- TiDB
- Tokutek
- Vertica
- WiredTiger
- AlloyDB
- Arrow
- BlazingDB
- Chaos Mesh
- CockroachDB
- CrateDB
- Datometry
- Dolt
- DVMS
- eXtremeDB
- Featureform
- Gaia
- GoogleSQL
- Heron
- kdb
- LeanStore
- MapD
- Milvus
- MySQL
- Noria
- Oracle
- Pinecone
- PostgreSQL
- QuasarDB
- RelationalAI
- RonDB
- ScyllaDB
- Smooth
- Splice Machine
- SQLancer
- StarRocks
- Synnada
- TigerBeetle
- Trino
- Vitesse
- Yellowbrick
Jan 21
2025
SplitSQL: Practical Pushdown Cache for DataLake Analytics (Xiangpeng Hao)
- Speaker:
- Xiangpeng Hao
- System:
- DataFusion
Modern data analytics embrace a disaggregated architecture which decouples storage, cache, and compute into network-connected independent components. With disaggregated cache, a key design decision is whether to push down query predicates to the cache server. Without predicate pushdown, the cache must send all data to compute nodes, creating network bottlenecks. With predicate pushdown, the cache server evaluates predicates on cached... Read More
Sep 30
2024
[Building Blocks] Accelerating Apache Spark workloads with Apache DataFusion Comet (Andy Grove)
- Speaker:
- Andy Grove
- System:
- DataFusion
- Video:
- YouTube
Apache Spark is one of the most widely-used distributed data analysis frameworks. However, its JVM-based and row-oriented query execution engine limits Spark’s performance and scalability. In this talk, we will introduce DataFusion Comet, an accelerator for Apache Spark designed to improve the efficiency of Spark queries by translating them into native queries that leverage Apache Arrow and Apache DataFusion. We... Read More
Sep 23
2024
[Building Blocks] Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine (Andrew Lamb)
- Speaker:
- Andrew Lamb
- System:
- DataFusion
- Video:
- YouTube
Apache DataFusion is a fast, embeddable, and extensible query engine written in Rust that uses Apache Arrow as its memory model. In this talk we explain DataFusion in more detail and describe the types of data centric systems it is used to build. We will also review its high level architecture and feature set, discussing tradeoffs and performance between DataFusion's... Read More