jeswan's Stars
puppeteer/puppeteer
JavaScript API for Chrome and Firefox
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Ebazhanov/linkedin-skill-assessments-quizzes
Full reference of LinkedIn answers 2024 for skill assessments (aws-lambda, rest-api, javascript, react, git, html, jquery, mongodb, java, Go, python, machine-learning, power-point) linkedin excel test lösungen, linkedin machine learning test LinkedIn test questions and answers
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—foundation models
pgvector/pgvector
Open-source vector similarity search for Postgres
google/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
microsoft/BioGPT
whitead/paper-qa
LLM Chain for answering questions from documents with citations
pyppeteer/pyppeteer
Headless chrome/chromium automation library (unofficial port of puppeteer)
miyakogi/pyppeteer
Headless chrome/chromium automation library (unofficial port of puppeteer)
juncongmoo/pyllama
LLaMA: Open and Efficient Foundation Language Models
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
sfikas/medical-imaging-datasets
A list of Medical imaging datasets.
tairov/llama2.mojo
Inference Llama 2 in one file of pure 🔥
castorini/pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
pytorch/data
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
mosaicml/streaming
A Data Streaming Library for Efficient Neural Network Training
KinWaiCheuk/nnAudio
Audio processing by using pytorch 1D convolution network
bloomberg/pystack
🔍 🐍 Like pstack but for Python!
scrapinghub/extruct
Extract embedded metadata from HTML markup
miso-belica/jusText
Heuristic based boilerplate removal tool
glample/fastBPE
Fast BPE
titipata/pubmed_parser
:clipboard: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset
minzwon/sota-music-tagging-models
mir-dataset-loaders/mirdata
Python library for working with Music Information Retrieval datasets
Spijkervet/torchaudio-augmentations
Audio transformations library for PyTorch
hstojic/ChapelleLi_2011_replication
Replication of simulations and results from Chapelle, O., & Li, L. (2011). An empirical evaluation of Thompson sampling. Advances in neural information processing systems, 2249-2257.