/DBPapers

Classic and recent three years papers of each topic

DBPapers

Classic or recent papers of each topic

Contributed by Song Bian (The Chinese University of Hong Kong)

Content

  1. DB + ML

  2. Data Exploration and Integration

  3. Graph Database

  4. Query Processing and Optimization

  5. New Hardware

  6. Stream-and-Sketch

  7. Security and Privacy

  8. Transaction and Concurrency Control

  9. Key Value Store

  10. Logging and Recovery

DB + ML

Data Exploration and Integration

Provenance and Explanation

  1. Hypothetical Reasoning via Provenance Abstraction. SIGMOD 2019. paper
  2. Ariadne: Online Provenance for Big Graph Analytics. SIGMOD 2019. paper
  3. Going Beyond Provenance: Explaining Query Answers with Pattern-based Counterbalances. SIGMOD 2019. paper
  4. Explaining Wrong Queries Using Small Examples SIGMOD 2019. paper
  5. Answering Why-questions by Exemplars in Attributed Graphs. SIGMOD 2019. paper
  6. Explaining Query Answers with Explanation-Ready Databases. PVLDB 2015. paper
  7. A Formal Approach to Finding Explanations for Database Queries. SIGMOD 2014. paper
  8. Provenance for Aggregate Queries. PODS 2011. paper

Data Exploration

  1. Fast Approximation of Empirical Entropy via Subsampling. KDD 2019. paper
  2. Constraint-based Explanation and Repair of Filter-based Transformations. PVLDB 2018. paper
  3. DIFF: A Relational Interface for Large-Scale Data Explanation. PVLDB 2018. paper

Fairness

  1. Interventional Fairness : Causal Database Repair for Algorithmic Fairness. SIGMOD 2019. paper
  2. Operationalizing Individual Fairness with Pairwise Fair Representations. PVLDB 2019. paper
  3. Certifying and Removing Disparate Impact. KDD 2015. paper
  4. Learning Fair Representations. ICML 2013. paper
  5. Fairness Through Awareness. ITCS 2012. paper

Data Visualization

  1. Anti-Freeze for Large and Complex Spreadsheets: Asynchronous Formula Computation. SIGMOD 2019. paper
  2. Towards Democratizing Relational Data Visualization. SIGMOD 2019. paper
  3. Efficient Selection of Geospatial Data on Maps for Interactive and Visualized Exploration. SIGMOD 2018. paper
  4. Adaptive Sampling for Rapidly Matching Histograms. PVLDB 2018. paper

Graph Database

Shortest Path

  1. Scaling Distance Labeling on Small-World Networks. SIGMOD 2019. paper
  2. Efficiently Answering Regular Simple Path Queries on Large Labeled Networks. SIGMOD 2019. paper
  3. When Hierarchy Meets 2-Hop-Labeling: Efficient Shortest Distance Queries on Road Networks. SIGMOD 2018. paper

Influence Maximization

  1. Maximizing Welfare in Social Networks under A Utility Driven Influence Diffusion model. SIGMOD 2019. paper
  2. Online Processing Algorithms for Influence Maximization. SIGMOD 2018. paper
  3. Finding Seeds and Relevant Tags Jointly: For Targeted Influence Maximization in Social Networks. SIGMOD 2018. paper
  4. Efficient Algorithms for Adaptive Influence Maximization. PVLDB 2018. paper
  5. Influence Maximization in Near-Linear Time: A Martingale Approach. SIGMOD 2015. paper
  6. Influence Maximization: Near-Optimal Time Complexity Meets Practical Efficiency. SIGMOD 2014. paper

Personalized PageRank

  1. TopPPR: Top-k Personalized PageRank Queries with Precision Guarantees on Large Graphs SIGMOD 2018. paper
  2. FORA: Simple and Effective Approximate Single-Source Personalized PageRank. KDD 2017. paper
  3. HubPPR: Effective Indexing for Approximate Personalized PageRank. PVLDB 2016. paper
  4. Approximate Personalized PageRank on Dynamic Graphs. KDD 2016. paper
  5. Personalized PageRank Estimation and Search: A Bidirectional Approach. WSDM 2016. paper
  6. Local Computation of PageRank Contributions. WAW 2007. paper
  7. Local Graph Partitioning using PageRank Vectors. FOCS 2006. paper
  8. Scaling Personalized Web Search. WWW 2003. paper

Knowledge Graph

  1. Efficient Knowledge Graph Accuracy Evaluation. PVLDB 2019. paper
  2. Maverick: Discovering Exceptional Facts from Knowledge Graphs. SIGMOD 2018. paper
  3. Question Answering Over Knowledge Graphs: Question Understanding Via Template Decomposition. PVLDB 2018. paper

Graph Dependence

  1. Deducing Certain Fixes to Graphs. PVLDB 2019. paper
  2. Discovering Graph Functional Dependencies. SIGMOD 2018. paper
  3. Dependencies for Graphs. PODS 2017. paper

Query Processing and Optimization

Approximate Query Processing

  1. DBEst: Revisiting Approximate Query Processing Engines with Machine Learning Models. SIGMOD 2019. paper
  2. Deep Unsupervised Cardinality Estimation PVLDB 2019. paper
  3. Learning to Sample: Counting with Complex Queries. PVLDB 2019. paper
  4. Random Sampling over Joins Revisited. SIGMOD 2018. paper
  5. AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics. SIGMOD 2018. paper
  6. Two-Level Sampling for Join Size Estimation. SIGMOD 2017. paper

Query Optimization

  1. Pessimistic Cardinality Estimation: Tighter Upper Bounds for Intermediate Join Cardinalities. SIGMOD 2019. paper
  2. Exact Cardinality Query Optimization with Bounded Execution Cost. SIGMOD 2019. paper
  3. Adaptive Optimization of Very Large Join Queries. SIGMOD 2018. paper

Skyline Query and Regret

  1. RRR: Rank-Regret Representative. SIGMOD 2019. paper
  2. Efficient k-Regret Query Algorithm with Restriction-free Bound for any Dimensionality. SIGMOD 2018. paper
  3. On Obtaining Stable Rankings. PVLDB 2018. paper

New Hardware

GPU/FPGA

NVM

  1. Managing Non-Volatile Memory in Database Systems. SIGMOD 2018. paper
  2. Let’s Talk About Storage & Recovery Methods for NVM DBMS. sigmod 2015. paper

SSD

Stream & Sketch

Security and Privacy

  1. Obladi: Oblivious Serializable Transactions in the Cloud. OSDI 2018. paper
  2. SMCQL: Secure Querying for Federated Databases. VLDB 2017. paper
  3. Shrinkwrap: Differentially-Private Query Processing in Private Data Federations VLDB 2019. paper
  4. ObliDB: Oblivious Query Processing. VLDB 2020. paper
  5. Answering Multi-Dimensional Analytical Queries under local differential privacy. SIGMOD 2019. paper
  6. Multi-Dimensional Data Sharing and Analytics as services under local differential privacy. VLDB 2019. paper

Transaction and Concurrency Control

  1. Serializable Snapshot Isolation in PostgreSQL. VLDB 2012. paper
  2. Staring into the Abyss an evaluation of CC with 1000 scores. VLDB 2014. paper
  3. Rethinking serializable multiversion concurrency control. VLDB 2015. paper
  4. Mostly-Optimistic Concurrency Control for Highly contended dynamic workloasd on a thousand cores. VLDB 2016. paper
  5. Fast Databases with Fast Durability and recovery through multicore parallelism. OSDI 2014. paper
  6. Low Overhead Concurrency Control for Partitioned Main memory databases. SIGMOD 2010. paper

Key Value Store

Logging and Recovery