Pinned Repositories
caesura
Language Models as Multi-Modal Query Planners
deepdb-public
Implementation of DeepDB: Learn from Data, not from Queries!
DFI-public
lcm-eval
This is the source code of the SIGMOD paper: "How Good are Learned Cost Models, Really? Insights From Query Optimization Tasks"
p4db
This is the source code for our (Matthias Jasny, Lasse Thostrup, Tobias Ziegler and Carsten Binnig) published paper at SIGMOD’22: P4DB - The Case for In-Network OLTP.
ParaphraseBench
A benchmark to test linguistic robustness.
RDMA_synchronization
This is the source code for our (Tobias Ziegler, Jacob Nelson-Slivon, Carsten Binnig and Viktor Leis) published paper at SIGMOD’23: Design Guidelines for Correct, Efficient, and Scalable Synchronization using One-Sided RDMA
ScaleStore
This is the source code for our (Tobias Ziegler, Carsten Binnig and Viktor Leis) published paper at SIGMOD’22: ScaleStore: A Fast and Cost-Efficient Storage Engine using DRAM, NVMe, and RDMA.
wannadb
WannaDB: Ad-hoc SQL Queries over Text Collections
zero-shot-cost-estimation
Implementation of our VLDB'22 paper "Zero-Shot Cost Models for Out-of-the-box Learned Cost Prediction"
DataManagementLab's Repositories
DataManagementLab/lcm-eval
This is the source code of the SIGMOD paper: "How Good are Learned Cost Models, Really? Insights From Query Optimization Tasks"
DataManagementLab/wannadb
WannaDB: Ad-hoc SQL Queries over Text Collections
DataManagementLab/wikidbs-public
DataManagementLab/eleet
Repository for ELEET: Efficient Learned Query Execution over Text and Tables
DataManagementLab/JOB-Complex
Sourcecode of our AIDB '25 paper "JOB-Complex: A Challenging Benchmark for Traditional & Learned Query Optimization"
DataManagementLab/Graceful
Sourcecode of our ICDE '25 paper "GRACEFUL: A Learned Cost Estimator For UDFs" (Johannes Wehrstein, Tiemo Bang, Roman Heinrich, Carsten Binnig)
DataManagementLab/llmeval-enterprise-challenges
Unveiling Challenges for LLMs in Enterprise Data Engineering
DataManagementLab/data_cleaning_with_latent_operators
Code repository for the Lopster paper on data cleaning.
DataManagementLab/demo-explain-lcms
Source Code for VLDB Demo Submission
DataManagementLab/justine-demo
Demo for JUSTINE, our system for self-organizing Schemas
DataManagementLab/ThesisPortal
Portal für Abschlussarbeiten - Bachelorpraktikum 2022/23
DataManagementLab/PIPE-X
This repository contains code for PIPE-X, a system providing impact measures of preprocessing steps.
DataManagementLab/Redbench
Sourcecode of the paper Redbench: Workload Synthesis From Cloud Traces
DataManagementLab/Redbench-Eval
Evaluation scripts for Redbench
DataManagementLab/reflex-smpc-analytics
Improving the Performance of Secure Data Analytics via Controlled Intermediate Result Size Disclosure
DataManagementLab/sgxv2-analytical-query-processing-benchmarks
DataManagementLab/llmeval-tada24
LLMs for Data Engineering on Enterprise Data
DataManagementLab/llmeval-trl24
Automating Enterprise Data Engineering with LLMs
DataManagementLab/A-Vision-for-Autonomous-Data-Agent-Collaboration
In this repository we will publish for our vision to go from Query-by-Integration to Query-by-Collaboration.
DataManagementLab/full-DBMS-in-SGX-experiments
Repository for the full DBMS experiments in our paper "Towards High-performance and Trusted Cloud DBMSs"
DataManagementLab/full-DBMS-in-SGX-Gramine
Fork of Gramine containing a fix for the memmgr performance bug
DataManagementLab/full-DBMS-in-SGX-Hyrise
Fork of Hyrise with minor changes for execution in SGX using Gramine
DataManagementLab/gui4de-lab25
Graphical User Interface for LLM-based Data Engineering
DataManagementLab/nitro-enclaves-benchmarks
DataManagementLab/otqa-dasp25
Towards Complex Table Question Answering Over Tabular Data Lakes (Extended Version)
DataManagementLab/PBench
DataManagementLab/redbench_hillmapper
Moderately-optimized hill climbing algorithm implementation to solve the scanset matching optimization problem relevant for Redbench.
DataManagementLab/Samsara
Repository for Samsara: Towards a Multimodal Stream Processing System
DataManagementLab/SPFlow
Sum Product Flow: An Easy and Extensible Library for Sum-Product Networks
DataManagementLab/trl_murban
Train transformer language models with reinforcement learning.