reza-putra's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
sebastianruder/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
EthicalML/awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
gunthercox/ChatterBot
ChatterBot is a machine learning, conversational dialog engine for creating chat bots
h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
doccano/doccano
Open source annotation tool for machine learning practitioners.
google/trax
Trax — Deep Learning with Clear Code and Speed
alirezamika/autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
princeton-nlp/tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
makcedward/nlpaug
Data augmentation for NLP
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
deep-diver/LLM-As-Chatbot
LLM as a Chatbot Service
run-llama/sec-insights
A real world full-stack application using LlamaIndex
cbailes/awesome-deep-trading
List of awesome resources for machine learning-based algorithmic trading
imthaghost/goclone
Website Cloner - Utilizes powerful Go routines to clone websites to your computer within seconds.
PolyAI-LDN/conversational-datasets
Large datasets for conversational AI
makcedward/nlp
:memo: This repository recorded my NLP journey.
midas-research/audino
Open source audio annotation tool for humans
Liuhong99/Sophia
The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
IndoNLP/indonlu
The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
facebookresearch/atlas
Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)
IndoNLP/nusa-crowd
A collaborative project to collect datasets in Indonesian languages.
samueldobbie/markup
A web-based document annotation tool, powered by GPT-4 :rocket:
dale3h/alexa-skills-list
A complete list of all available Alexa Skills
clinc/oos-eval
Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)
JetRunner/SuperICL
Code for "Small Models are Valuable Plug-ins for Large Language Models"