huggingface-transformers

There are 967 repositories under huggingface-transformers topic.

  • christianversloot/machine-learning-articles

    🧠💬 Articles I wrote about machine learning, archived from MachineCurve.com.

  • yuanzhoulvpi2017/zero_nlp

    中文nlp解决方案(大模型、数据、模型、训练、推理)

    Language:Python2.5k30172328
  • katanaml/sparrow

    Data processing with ML and LLM

    Language:Python2.1k3450252
  • lxe/simple-llm-finetuner

    Simple UI for LLM Model Finetuning

    Language:Jupyter Notebook2k2048135
  • SocialEcho

    nz-m/SocialEcho

    Social networking platform with automated content moderation and context-based authentication system

    Language:JavaScript2k305462
  • amazon-science/chronos-forecasting

    Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting

    Language:Python1.9k2137222
  • refuel-ai/autolabel

    Label, clean and enrich text datasets with LLMs.

    Language:Python1.9k20233123
  • Tencent/TurboTransformers

    a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

    Language:C++1.4k41118193
  • uform

    unum-cloud/uform

    Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

    Language:Python921132455
  • unitaryai/detoxify

    Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.

    Language:Python8611561110
  • Denis2054/Transformers-for-NLP-2nd-Edition

    Transformer models from BERT to GPT-4, environments from Hugging Face to OpenAI. Fine-tuning, training, and prompt engineering examples. A bonus section with ChatGPT, GPT-3.5-turbo, GPT-4, and DALL-E including jump starting GPT-4, speech-to-text, text-to-speech, text to image generation with DALL-E, Google Cloud AI,HuggingGPT, and more

    Language:Jupyter Notebook686223266
  • OpenAdapt

    OpenAdaptAI/OpenAdapt

    AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

    Language:Python658837780
  • georgian-io/Multimodal-Toolkit

    Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

    Language:Python561255484
  • panaverse/learn-generative-ai

    Learn Cloud Applied Generative AI Engineering (GenEng) using OpenAI, Gemini, Streamlit, Containers, Serverless, Postgres, LangChain, Pinecone, and Next.js

    Language:Python553311193
  • huggingface/transformers-bloom-inference

    Fast Inference Solutions for BLOOM

    Language:Python5481264110
  • xlang-ai/UnifiedSKG

    [EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models

    Language:Python534113957
  • r2d4/rellm

    Exact structure out of any language model completion.

    Language:Python49211423
  • keytotext

    gagan3012/keytotext

    Keywords to Sentences

    Language:Jupyter Notebook439143660
  • Xirider/finetune-gpt2xl

    Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

    Language:Python42152273
  • barissayil/SentimentAnalysis

    Sentiment analysis neural network trained by fine-tuning BERT, ALBERT, or DistilBERT on the Stanford Sentiment Treebank.

    Language:Python35412947
  • varunshenoy/super-json-mode

    Low latency JSON generation using LLMs ⚡️

    Language:Jupyter Notebook3492511
  • THU-KEG/OmniEvent

    A comprehensive, unified and modular event extraction toolkit.

    Language:Python323102529
  • rizerphe/local-llm-function-calling

    A tool for generating function arguments and choosing what function to call with local LLMs

    Language:Python28641328
  • geeks-of-data/knowledge-gpt

    Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.

    Language:Python27251151
  • krishnap25/mauve

    Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.

    Language:Python26341325
  • aws-samples/amazon-sagemaker-local-mode

    Amazon SageMaker Local Mode Examples

    Language:Python23272255
  • sudharsan13296/Getting-Started-with-Google-BERT

    Build and train state-of-the-art natural language processing models using BERT

    Language:Jupyter Notebook2108082
  • ASR-project/Multilingual-PR

    Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.

    Language:Python1784515
  • daspartho/prompt-extend

    extending stable diffusion prompts with suitable style cues using text generation

    Language:Jupyter Notebook175438
  • ikergarcia1996/Easy-Translate

    Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible for beginners and as seamlesscustomizable and as possible for advanced users.

    Language:Python16588258
  • quickai

    geekjr/quickai

    QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.

    Language:Python1628216
  • microsoft/monitors4codegen

    Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multispy` is a lsp client library in Python intended to be used to build applications around language servers.

    Language:Python1587519
  • cahya-wirawan/indonesian-language-models

    Indonesian Language Models and its Usage

    Language:Jupyter Notebook15013328
  • Victarry/stable-dreambooth

    Dreambooth implementation based on Stable Diffusion with minimal code.

    Language:Python14241121
  • lxuechen/private-transformers

    A codebase that makes differentially private training of transformers easy.

    Language:Python14052920
  • arihanv/Shush

    Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app

    Language:TypeScript1364621