ytsaig

ytsaig's Stars

xtekky/gpt4free
The official gpt4free repository | various collection of powerful language models
Language:Python62.9k 486 1.5k13.5k
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Language:Python20.6k 260 722.6k
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Language:Python9.8k 90 366752
THUDM/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Language:Python7.7k 99 198608
alirezamika/autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Language:Python6.5k 124 63677
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Language:Python6.3k 52 1.7k773
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.8k 112 137419
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
Language:Python4.1k 33 2.2k389
jina-ai/discoart
🪩 Create Disco Diffusion artworks in one line
Language:Python3.8k 34 106249
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Language:Python3.8k 30 391266
Nixtla/neuralforecast
Scalable and user friendly neural :brain: forecasting algorithms.
Language:Python3.2k 36 568368
ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.
Language:Python3k 37 332374
fhamborg/news-please
news-please - an integrated web crawler and information extractor for news that just works
Language:Python2.1k 54 180431
allenai/natural-instructions
Expanding natural instructions
Language:Python964 21 161190
man-group/notebooker
Productionise & schedule your Jupyter Notebooks as easily as you wrote them.
Language:Python862 24 6982
booknlp/booknlp
BookNLP, a natural language processing pipeline for books
Language:Python810 23 25100
GEM-benchmark/NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
Language:Python778 23 52196
princeton-nlp/DensePhrases
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.org/abs/2012.12624
Language:Python606 13 3176
Nv7-GitHub/googlesearch
A Python library for scraping the Google search engine.
Language:Python548 6 54122
oughtinc/ice
Interactive Composition Explorer: a debugger for compositional language model programs
Language:Python537 10 3266
philschmid/easyllm
Language:Jupyter Notebook439 8 2137
r-three/t-few
Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"
Language:Python435 9 3361
YuanGongND/whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
Language:Python344 11 3228
leondz/hatespeechdata
Catalog of abusive language data (PLoS 2020)
Language:Python306 23 1176
JanPalasek/pretty-jupyter
Creates dynamic html report from jupyter notebook.
Language:Python298 6 5513
label-sleuth/label-sleuth
Open source no-code system for text annotation and building of text classifiers
Language:Python253 5 13740
santhoshse7en/news-fetch
A Python Package which helps to scrape all news details from any news websites
Language:Python186 10 18108
ExpressAI/reStructured-Pretraining
reStructured Pre-training
97 7 310
BBN-E/ZS4IE
ZS4IE: A Toolkit for Zero-Shot Information Extraction with Simple Verbalizations
Language:Python26 5 11
epfl-dlab/WikiHist.html
This is a repo containing all code and steps taken to download, setup the process and convert the whole English Wikipedia history from Wikitext to HTML format.
Language:PHP14 5 03