Pinned Repositories
medal
Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain
weblinx
WebLINX is a benchmark for building web navigation agents with conversational capabilities
webllama
Llama-3 agents that can browse the web by following instructions and talking to you
bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
covid-qa
A collection of COVID-19 question-answer pairs and transformer baselines for evaluating QA models (Official Repository)
dash-draggable
react-draggable in Python
dl-translate
Library for translating between 200 languages. Built on 🤗 transformers.
keras-noisy-student
EfficientNet-L2 weights in Keras and retrieval script modified from qubvel/efficientnet
keras-toolkit
A collection of functions to help you easily train and run Tensorflow Keras. It includes 1-line auto-TPU support, GPU memory management, and tf.data builders.
react-pyodide-template
A simple template to get started with pyodide inside React
xhluca's Repositories
xhluca/bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
xhluca/dl-translate
Library for translating between 200 languages. Built on 🤗 transformers.
xhluca/awesome-ml-visualization
Curated list of awesome ML Visualization Libraries
xhluca/llama-2-local-ui
Chat UI for locally-hosted LLaMA-2
xhluca/bm25-benchmarks
xhluca/simple-pubsub
A simple repository that implements a redis-style pub/sub with pure Python
xhluca/latex-vscode-template
A template repository for latex in vscode (via Latex Workshop), with GPT-written instructions on setting it up
xhluca/arxiv-assistant
A simple webapp for helping you navigate Arxiv.org
xhluca/BrowserGym
BrowserGym, a gym environment for web task automation in the Chromium browser.
xhluca/qr-code
A basic, free, ad-less, PWA-ready, open-source QR Code generator
xhluca/wikicat
Toolkit for managing and navigating graphs of Wikipedia categories
xhluca/browsergym-webarena-agent
xhluca/pyodide-blog
The Pyodide blog
xhluca/aclpubcheck
Tools for checking ACL paper submissions
xhluca/AgentLab
xhluca/blog
Public repo for HF blog posts
xhluca/browsergym-simple-agent
A simple agent in browsergym that will execute your actions verbatim
xhluca/calendar
testing
xhluca/cc-academic-publishers
[Work in progress] List of Academic Publishers with a creative commons license (e.g. CC-BY)
xhluca/cc-websites
Websites where the webpage is licensed under CC
xhluca/CSrankings
A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.
xhluca/merge-conflict-demo
xhluca/mini_wiki
xhluca/mteb
MTEB: Massive Text Embedding Benchmark
xhluca/snowball_stemmer_wheels
A fork of PyStemmer with pure wheels
xhluca/stark
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases (NeurIPS D&B 2024)
xhluca/test-jekyll
xhluca/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
xhluca/webarena-setup
Setup scripts for the WebArena benchmark
xhluca/webllama.github.io