Bailefan's Stars
MinishLab/semhash
Fast Semantic Text Deduplication
henrythe9th/AI-Crash-Course
AI Crash Course to help busy builders catch up to the public frontier of AI research in 2 weeks
merveenoyan/smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
NirDiamant/RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
llmgenai/LLMInterviewQuestions
This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.
Bailefan/Nano-ESG
A Dataset about Corporate Sustainability Information in News Articles
microsoft/markitdown
Python tool for converting files and office documents to Markdown.
aishwaryanr/awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
huggingface/smol-course
A course on aligning smol models.
cyclotruc/gitingest
Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase
Delgan/loguru
Python logging made (stupidly) simple
MinishLab/vicinity
Lightweight Nearest Neighbors with Flexible Backends
gmberton/awesome-machine-learning-startups
List of startups doing AI & ML
andrewyng/aisuite
Simple, unified interface to multiple Generative AI providers
fmind/mlops-python-package
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
explosion/spacy-layout
📚 Process PDFs, Word documents and more with spaCy
browser-use/browser-use
Make websites accessible for AI agents
marimo-team/marimo
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
dynamiq-ai/dynamiq
Dynamiq is an orchestration framework for agentic AI and LLM applications
i-am-bee/bee-agent-framework
Framework for building scalable agentic applications.
HandsOnLLM/Hands-On-Large-Language-Models
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
chonkie-ai/chonkie
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
huggingface/smollm
Everything about the SmolLM2 and SmolVLM family of models
fishaudio/fish-speech
SOTA Open Source TTS
DS4SD/docling
Get your documents ready for gen AI
niderhoff/knowledge-repository
knowledge repository with learning resources, examples, links for various data science / computer science topics
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
PacktPublishing/LLM-Engineers-Handbook
The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices