Pinned Repositories
aiflows
🤖🌊 aiFlows: The building blocks of your collaborative AI
cc_flows
The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".
Cr5
Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"
GCD
GenIE
The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.
GoogleTrendsAnchorBank
Google Trends, made easy.
homepage2vec
Language-Agnostic Website Embedding and Classification
llm-latent-language
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
SynthIE
The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction".
transformers-CFG
🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers
EPFL Data Science Lab (dlab)'s Repositories
epfl-dlab/quootstrap
Unsupervised method for extracting quotation-speaker pairs from large news corpora.
epfl-dlab/GraphCyclesRemoval
Implementation of "A fast and effective heuristic for the feedback arc set problem"
epfl-dlab/140_to_280
Repository for the paper “How Constraints Affect Content: The Case of Twitter's Switch from 140 to 280” published at ICWSM’18
epfl-dlab/structuring-wikipedia-articles
Structuring Wikipedia Articles with Section Recommendations
epfl-dlab/WCNPruning
A framework to clean the Wikipedia category network.
epfl-dlab/when_sheep_shop
Repository for the article "When Sheep Shop: Measuring Herding Effects in Product Ratings with Natural Experiments" published at WWW2018