MaelKubli
Data scientist at the Department for Political Science at the University of Zurich. Fluent in R and HTML. Intermediate in SQL and Elastic.
University of ZurichZurich
MaelKubli's Stars
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
ggerganov/llama.cpp
LLM inference in C/C++
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
datvodinh/rag-chatbot
Chat with multiple PDFs locally
michelle123lam/lloom
Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-level concepts to analyze unstructured text.
thieled/meteoR
[WIP] Unofficial R wrapper for the Meteor API of the OPTED project. https://meteor.opted.eu/api/swagger
jxmorris12/language_tool_python
a free python grammar checker 📝✅
zumbov2/swissparl
The Swiss Parliament Webservices R API
hide-ous/pytangle
python wrapper for crowdtangle
saurabhprasun20/qualtrics-automation
saurabhprasun20/citizen_science_backend
dfreelon/pyktok
A simple module to collect video, text, and metadata from Tiktok.
pushshift/api
Pushshift API
hartator/wayback-machine-downloader
Download an entire website from the Wayback Machine.
JBGruber/traktok
The goal of traktok is to provide easy access to TikTok data.
arsena-k/discourse_atoms
How are topics encoded in semantic space? Repository to accompany PNAS article: https://www.pnas.org/doi/10.1073/pnas.2108801119
AllanCameron/geomtextpath
Create curved text paths in ggplot2
TropComplique/lda2vec-pytorch
Topic modeling with word vectors
maxent-ai/lda2vec
Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
ryanjgallagher/focalevents
Tools for collecting social media data around focal events
Heidelberg-NLP/xsrl_mbert_aligner
X-SRL Dataset. Including the code for the SRL annotation projection tool and an out-of-the-box word alignment tool based on Multilingual BERT embeddings.
relatio-nlp/relatio
code base for constructing narrative statements from text
zumbov2/deeplr
An R wrapper for the DeepL Translator API
CrowdTangle/API
API Documentation
chiphuyen/machine-learning-systems-design
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"
chiphuyen/python-is-cool
Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.
chiphuyen/stanford-tensorflow-tutorials
This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
lvonlanthen/data-map-d3
A step-by-step tutorial to create a simple data map using D3.js