- Aerospike
- Alibaba
- Anna
- APOLLO
- Azure Cosmos DB
- BigQuery
- Bodo
- Cassandra
- Chroma
- ClickHouse
- Confluent
- CouchDB
- CrocodileDB
- DataFusion
- Datomic
- Debezium
- Dremio
- DuckDB
- EdgeDB
- Exon
- FASTER
- FeatureBase
- Feldera
- Fluree
- Gaia
- GlareDB
- GoogleSQL
- GreptimeDB
- Heron
- InfluxDB
- kdb
- ksqlDB
- LeanStore
- LMDB
- MapD
- Materialize
- Milvus
- MonetDB
- MySQL
- Neon
- Noria
- OceanBase
- Oracle
- OxQL
- Pinecone
- PlanetScale
- PostgreSQL
- Qdrant
- QuasarDB
- RavenDB
- RelationalAI
- RocksDB
- RonDB
- SalesForce
- ScyllaDB
- sled
- Smooth
- Spice.ai
- Splice Machine
- SQL Anywhere
- SQLancer
- SQream
- StarRocks
- Summingbird
- Synnada
- TerminusDB
- TigerBeetle
- TimescaleDB
- Trino
- Velox
- Vitesse
- Weaviate
- Yellowbrick
- Akamas
- AlloyDB
- ApertureDB
- Arrow
- Berkeley DB
- BlazingDB
- Brytlyt
- Chaos Mesh
- Citus
- CockroachDB
- Convex
- CrateDB
- Databricks
- Datometry
- dbt
- Dolt
- Druid
- DVMS
- EraDB
- eXtremeDB
- Fauna
- Featureform
- Firebolt
- FoundationDB
- Gel
- Google Spanner
- Greenplum
- HarperDB
- Impala
- Jepsen
- Kinetica
- LanceDB
- Litestream
- Malloy
- MariaDB
- MemSQL
- Modin
- MongoDB
- Napa
- NoisePage
- NuoDB
- OpenDAL
- OtterTune
- ParadeDB
- Pinot
- PostgresML
- PRQL
- QMDB
- QuestDB
- Redshift
- RisingWave
- Rockset
- rqlite
- Samza
- SingleStore
- SLOG
- Snowflake
- SpiceDB
- SplinterDB
- SQL Server
- SQLite
- Stardog
- Striim
- Swarm64
- Technical University of Munich
- TiDB
- TileDB
- Tokutek
- Umbra
- Vertica
- VoltDB
- WiredTiger
- YugabyteDB
- Aerospike
- AlloyDB
- APOLLO
- Berkeley DB
- Bodo
- Chaos Mesh
- ClickHouse
- Convex
- CrocodileDB
- Datometry
- Debezium
- Druid
- EdgeDB
- eXtremeDB
- FeatureBase
- Firebolt
- Gaia
- Google Spanner
- GreptimeDB
- Impala
- kdb
- LanceDB
- LMDB
- MariaDB
- Milvus
- MongoDB
- Neon
- NuoDB
- Oracle
- ParadeDB
- PlanetScale
- PRQL
- QuasarDB
- Redshift
- RocksDB
- rqlite
- ScyllaDB
- SLOG
- Spice.ai
- SplinterDB
- SQLancer
- Stardog
- Summingbird
- Technical University of Munich
- TigerBeetle
- Tokutek
- Velox
- VoltDB
- Yellowbrick
- Akamas
- Anna
- Arrow
- BigQuery
- Brytlyt
- Chroma
- CockroachDB
- CouchDB
- Databricks
- Datomic
- Dolt
- DuckDB
- EraDB
- FASTER
- Featureform
- Fluree
- Gel
- GoogleSQL
- HarperDB
- InfluxDB
- Kinetica
- LeanStore
- Malloy
- Materialize
- Modin
- MySQL
- NoisePage
- OceanBase
- OtterTune
- Pinecone
- PostgresML
- Qdrant
- QuestDB
- RelationalAI
- Rockset
- SalesForce
- SingleStore
- Smooth
- SpiceDB
- SQL Anywhere
- SQLite
- StarRocks
- Swarm64
- TerminusDB
- TileDB
- Trino
- Vertica
- Weaviate
- YugabyteDB
- Alibaba
- ApertureDB
- Azure Cosmos DB
- BlazingDB
- Cassandra
- Citus
- Confluent
- CrateDB
- DataFusion
- dbt
- Dremio
- DVMS
- Exon
- Fauna
- Feldera
- FoundationDB
- GlareDB
- Greenplum
- Heron
- Jepsen
- ksqlDB
- Litestream
- MapD
- MemSQL
- MonetDB
- Napa
- Noria
- OpenDAL
- OxQL
- Pinot
- PostgreSQL
- QMDB
- RavenDB
- RisingWave
- RonDB
- Samza
- sled
- Snowflake
- Splice Machine
- SQL Server
- SQream
- Striim
- Synnada
- TiDB
- TimescaleDB
- Umbra
- Vitesse
- WiredTiger
- Aerospike
- Anna
- Azure Cosmos DB
- Bodo
- Chroma
- Confluent
- CrocodileDB
- Datomic
- Dremio
- EdgeDB
- FASTER
- Feldera
- Gaia
- GoogleSQL
- Heron
- kdb
- LeanStore
- MapD
- Milvus
- MySQL
- Noria
- Oracle
- Pinecone
- PostgreSQL
- QuasarDB
- RelationalAI
- RonDB
- ScyllaDB
- Smooth
- Splice Machine
- SQLancer
- StarRocks
- Synnada
- TigerBeetle
- Trino
- Vitesse
- Yellowbrick
- Akamas
- ApertureDB
- Berkeley DB
- Brytlyt
- Citus
- Convex
- Databricks
- dbt
- Druid
- EraDB
- Fauna
- Firebolt
- Gel
- Greenplum
- Impala
- Kinetica
- Litestream
- MariaDB
- Modin
- Napa
- NuoDB
- OtterTune
- Pinot
- PRQL
- QuestDB
- RisingWave
- rqlite
- SingleStore
- Snowflake
- SplinterDB
- SQLite
- Striim
- Technical University of Munich
- TileDB
- Umbra
- VoltDB
- YugabyteDB
- Alibaba
- APOLLO
- BigQuery
- Cassandra
- ClickHouse
- CouchDB
- DataFusion
- Debezium
- DuckDB
- Exon
- FeatureBase
- Fluree
- GlareDB
- GreptimeDB
- InfluxDB
- ksqlDB
- LMDB
- Materialize
- MonetDB
- Neon
- OceanBase
- OxQL
- PlanetScale
- Qdrant
- RavenDB
- RocksDB
- SalesForce
- sled
- Spice.ai
- SQL Anywhere
- SQream
- Summingbird
- TerminusDB
- TimescaleDB
- Velox
- Weaviate
- AlloyDB
- Arrow
- BlazingDB
- Chaos Mesh
- CockroachDB
- CrateDB
- Datometry
- Dolt
- DVMS
- eXtremeDB
- Featureform
- FoundationDB
- Google Spanner
- HarperDB
- Jepsen
- LanceDB
- Malloy
- MemSQL
- MongoDB
- NoisePage
- OpenDAL
- ParadeDB
- PostgresML
- QMDB
- Redshift
- Rockset
- Samza
- SLOG
- SpiceDB
- SQL Server
- Stardog
- Swarm64
- TiDB
- Tokutek
- Vertica
- WiredTiger
May 5
2017
Miguel Araújo (Thesis defense dry-run)
- Speaker:
- Miguel Araújo
The identification of anomalies and communities of nodes in real-world graphs has applications in widespread domains, from the automatic categorization of wikipedia articles or websites to bank fraud detection. While recent and ongoing research is supplying tools for the analysis of simple unlabeled data, it is still a challenge to find patterns and anomalies in large labeled datasets such as time evolving networks. What do... Read More
May 2
2017
Agma Traina and Caetano Traina (University of São Paulo)
- Speaker:
- Agma Traina and Caetano Traina
The evolution of the Relational Database Management Systems must include not only resources to handle big data, but also complex data (such as images, audios, videos, graphs, multidimensional data, long texts, time series, genetic sequences, etc.), where order-based comparisons are not appropriate, and identity-based comparisons are meaningless. Comparing complex data by similarity stirrers much more meaning from data. However, current... Read More
May 1
2017
[DB Seminar] Spring 2017: Marcel Kornacker
- Speaker:
- Marcel Kornacker
- System:
- Impala
Running real-time data-intensive applications on Apache Hadoop requires complex architectures to store and query data, typically involving multiple independent systems that are tied together through custom-engineered pipelines. A common pattern is to use a NoSQL engine like Apache HBase for caching and later transformations, the results of which are periodically written to HDFS in one of the popular open columnar... Read More
Apr 25
2017
Dhivya Eswaran and Zongge Liu (SDM2017 dry run)
- Speaker:
- Dhivya Eswaran and Zongge Liu
Dhivya and Zongge will have dry runs for SDM 2017. Dhivya's talk information: Title: The Power of Certainty: A Dirichlet Multinomial Model for Belief Propagation Abstract: Given a friendship network, how certain are we that Smith is a progressive (vs. conservative)? How can we propagate these certainties through the network? While Belief propagation marked the beginning of principled label propagation to classify... Read More
Apr 24
2017
[DB Seminar] Spring 2017: Dana Van Aken
- Speaker:
- Dana Van Aken
- System:
- OtterTune
Database management system (DBMS) configuration tuning is an essential aspect of any data-intensive application effort. But this is historically a difficult task because DBMSs have hundreds of configuration "knobs" that control everything in the system, such as the amount of memory to use for caches and how often data is written to storage. The problem with these knobs is that... Read More
Apr 17
2017
[DB Seminar] Spring 2017: Hyeontaek Lim
- Speaker:
- Hyeontaek Lim
Multi-core in-memory databases promise high-speed online transaction processing. However, the performance of individual designs suffers when the workload characteristics miss their small sweet spot of a desired contention level, read-write ratio, record size, processing rate, and so forth. Cicada is a single-node multi-core in-memory transactional database with serializability. To provide high performance under diverse workloads, Cicada reduces overhead and contention... Read More
Apr 10
2017
[DB Seminar] Spring 2017: Mohammad Hammoud
- Speaker:
- Mohammad Hammoud
Relational join is a fundamental data management operation, which highly influences the performance of almost every database query. In this talk, I will show that different workload characteristics and hardware configurations necessitate different main-memory hash join models. Subsequently, I will identify four effective models by which any hash-based join algorithm can be executed. I will characterize the relative merits of... Read More
Apr 6
2017
[PDL/SDI/ISTC] Derek Murray (Google)
- Speaker:
- Derek Murray
TensorFlow is an open-source machine learning system, originally developed by the Google Brain team, which operates at large scale and in heterogeneous environments. TensorFlow trains and executes a variety of machine learning models at Google, including deep neural networks for image recognition and machine translation. The system uses dataflow graphs to represent stateful computations, and achieves high performance by mapping... Read More
Apr 3
2017
[DB Seminar] Spring 2017: Prashanth Menon
- Speaker:
- Prashanth Menon
In-memory database management systems (DBMSs) are a key component of modern on-line analytic processing (OLAP) applications, since they provide low-latency access to large volumes of data. Because disk accesses are no longer the principle bottleneck in such systems, the focus in designing query execution engines has shifted to optimizing CPU performance. Recent systems have revived an older technique of using... Read More
Mar 27
2017
[DB Seminar] Spring 2017: Viktor Leis
- Speaker:
- Viktor Leis
Managing data sets that are larger than RAM has always been one of the most important tasks for database systems. Traditional systems cache fixed-size pages in an in-memory buffer pool that has complete knowledge of all page accesses and transparently loads/evicts pages from/to disk. While this approach is effective at minimizing the number of I/O operations, it is also one... Read More