trifle's Stars
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Const-me/Whisper
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
alufers/mitmproxy2swagger
Automagically reverse-engineer REST APIs via capturing traffic
pufferffish/wireproxy
Wireguard client that exposes itself as a socks5 proxy
rspeer/python-ftfy
Fixes mojibake and other glitches in Unicode text, after the fact.
MaartenGr/KeyBERT
Minimal keyword extraction with BERT
qwj/python-proxy
HTTP/HTTP2/HTTP3/Socks4/Socks5/Shadowsocks/ShadowsocksR/SSH/Redirect/Pf TCP/UDP asynchronous tunnel proxy implemented in Python 3 asyncio.
aramperes/onetun
User space WireGuard port-forward in Rust
MaartenGr/PolyFuzz
Fuzzy string matching, grouping, and evaluation.
kjhealy/pandoc-templates
Some templates for Pandoc.
bellingcat/auto-archiver
Automatically archive links to videos, images, and social media content from Google Sheets (and more).
Florents-Tselai/WarcDB
WarcDB: Web crawl data as SQLite databases.
IQTLabs/SkyScan
Automatically photograph planes that fly by!
internetarchive/dweb-mirror
Offline Internet Archive project
AliasIO/demodal
Demodal is a browser extension that automatically removes content blocking modals including paywalls, discount offers, promts to sign up or enter your email address and more.
MaartenGr/Concept
Concept Modeling: Topic Modeling on Images and Text
fabiogiglietto/CooRnet
Given a set of URLs, this packages detects coordinated link sharing behavior on social media and outputs the network of entities that performed such behaviour.
eth-easl/cachew
ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).
fccsdigitalchina/digital-china
A curated list of digital things related to the field of Chinese studies.
citizenlab/tiktok-report-data
nail1021734/Taiwan_news_dataset
abitter/PTOS
Practices and Tools of Open Science: Topic Modeling
qut-dmrc/Crowd
trifle/face
Fully automated face extraction and age/gender prediction for images and videos
gbrindisi/applebooks
A tiny python library to interact with Apple Books databases
abitter/psychtopics
PsychTopics – A user-friendly app for exploring and analyzing research topics in psychology
IKMLab/Taiwan_news_dataset