language-models

There are 267 repositories under language-models topic.

  • Awesome-NLP-Resources

    This repository contains landmark research papers in Natural Language Processing that came out in this century.

  • JAXSeq

    Train very large language models in Jax.

    Language:Python209
  • tldr-transformers

    tldr-transformers

    The "tl;dr" on a few notable transformer papers (pre-2022).

  • LangChain-Chat-with-Your-Data

    Explore LangChain and build powerful chatbots that interact with your own data. Gain insights into document loading, splitting, retrieval, question answering, and more.

    Language:Jupyter Notebook183
  • pretraining-with-human-feedback

    Code accompanying the paper Pretraining Language Models with Human Preferences

    Language:Python180
  • language-models-are-knowledge-graphs-pytorch

    Language models are open knowledge graphs ( non official implementation )

    Language:Python169
  • next-token-prediction

    Next-token prediction in JavaScript — build fast language and diffusion models.

    Language:JavaScript143
  • Confucius-tool-learning

    Official Repo for AAAI 2024 paper "Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum"

    Language:Python140
  • DART

    [ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners

    Language:Python131
  • transformer-ranker

    Efficiently find the best-suited language model (LM) for your NLP task

    Language:Python127
  • MSR-NLP-Projects

    This is a list of open-source projects at Microsoft Research NLP Group

  • marbert

    marbert

    UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic

  • nlp-papers

    Must-read papers on Natural Language Processing (NLP)

  • EMNLP-2023-Papers

    EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning, deep learning, and natural language processing with code included. :star: support NLP!

    Language:Python109
  • quanta-quest

    AI-powered universal search for all your personal data, tailored just for you. Goal:The world's first product with "edge-side LLMs + consumer data localization" as its core development direction.

    Language:TypeScript104
  • ROSGPT_Vision

    Commanding robots using only Language Models' prompts

    Language:Python102
  • Hugging-Captions

    Hugging-Captions

    Generate realistic Instagram captions using transformers 🤗

    Language:Python101
  • bert-in-production

    A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 ) and related Language Models in production environments.

  • nlg-metricverse

    [COLING22] An End-to-End Library for Evaluating Natural Language Generation

    Language:Python92
  • ngram-language-model

    Python implementation of an N-gram language model with Laplace smoothing and sentence generation.

    Language:Python87
  • InfoBERT

    [ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Yu Cheng, Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu

    Language:Python85
  • turkish-language-models

    Transformer based Turkish language models

    Language:Python78
  • disco

    A Toolkit for Distributional Control of Generative Models

    Language:Python73
  • cybertron-ai

    mindspore implementation of transformers

    Language:Python69
  • llm-datasets

    A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.

    Language:Python61
  • Building-Systems-with-the-ChatGPT-API

    In Building Systems With The ChatGPT API, you will learn how to automate complex workflows using chain calls to a large language model.

    Language:Jupyter Notebook59
  • Stanford-CS224n-NLP

    The course notes about Stanford CS224n Natural Language Processing with Deep Learning Winter 2019 (using PyTorch)

    Language:JavaScript51
  • llama-multimodal-vqa

    Multimodal Instruction Tuning for Llama 3

    Language:Python50
  • language-pretraining

    Pre-training Language Models for Japanese

    Language:Python50
  • TabSTAR

    TabSTAR

    TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations

    Language:Python49
  • CaLM

    Protein language model trained on coding DNA

    Language:Python49
  • Awesome-SciLM

    Pre-trained Language Model for Scientific Text

  • alm

    alm

    Smart Language Model

    Language:C++46
  • language-models

    Keras implementations of three language models: character-level RNN, word-level RNN and Sentence VAE (Bowman, Vilnis et al 2016).

    Language:Python43
  • llm-math-education

    Retrieval augmented generation for middle-school math question answering and hint generation.

    Language:Jupyter Notebook42
  • tree-prompt

    Tree prompting: easy-to-use scikit-learn interface for improved prompting.

    Language:Jupyter Notebook41