suhara

@NVIDIASanta Clara, CA

suhara's Stars

megagonlabs/napa
🍷 Code for Noisy Pairing and Partial Supervision for Stylized Opinion Summarization (Iso et al; INLG 2024)
Language:Python2
abetlen/llama-cpp-python
Python bindings for llama.cpp
Language:Python8.3k1k
shizhediao/Post-Training-Data-Flywheel
We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.
Language:Python483
NVIDIA/RULER
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Language:Python79453
llm-jp/awesome-japanese-llm
日本語LLMまとめ - Overview of Japanese LLMs
Language:TypeScript1.1k31
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda24.8k2.8k
shure-dev/Awesome-LLM-Papers-Comprehensive-Topics
Awesome LLM Papers and repos on very comprehensive topics.
19621
NVIDIA/NeMo-Curator
Scalable data pre processing and curation toolkit for LLMs
Language:Jupyter Notebook68791
ddhruvkr/CONTRADOC
Language:Python93
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python12.5k2.6k
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
Language:Python65179
daviddao/awful-ai
😈Awful AI is a curated list to track current scary usages of AI - hoping to raise awareness
7k233
hplt-project/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Language:Python48959
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
Language:Python2.2k192
megagonlabs/doduo
Annotating Columns with Pre-trained Language Models
Language:Python3110
gotutiyan/GEC-Info
Repository to collect and categorize Grammatical Error Correction papers.
11410
gentaiscool/indonesian-nlp
A curated list of research papers and resources on Indonesian languages
393
meetdavidwan/factpegasus
PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)
Language:Python381
mhagiwara/xfspell
xfspell — the Transformer Spell Checker
Language:Shell18822
abrazinskas/sigir2022-opinion-summarization-tutorial
This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.
341
megagonlabs/cocosum
:coconut: Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)
Language:Python212
ucinlp/autoprompt
AutoPrompt: Automatic Prompt Construction for Masked Language Models.
Language:Python59881
KalaniStanton/LSTMvBERT-NER
This repository contains multiple notebooks (created using Google Colab) that transform data from a doccano format for use in training a Bi-LSTM-CRF, fine-tuning a transformer using custom labels, and classifying using the fine-tuned bert-base-ner model
Language:Jupyter Notebook71
sfs0126/Lyric-Generator-fine-tuned-GPT-2
This project uses Huggingface transformers GPT-2 to fine-tune text generation models based on lyric data to specific music genres.
Language:Jupyter Notebook61
vivienneprince/bookcovers-ml-python
Judge a book by it's cover. Data from Open Library
Language:Jupyter Notebook1
tmccormack165/McCormack_Final_IMDb
Final project for Topics in Computing
Language:Jupyter Notebook1
lashleyaq/CellSegmentation
Language:Jupyter Notebook1
Chowlett2/Auto_Colorizer
A Convolutional Autoencoder for image colorization in Pytorch
Language:Jupyter Notebook1
sarahaman/CIS6930_TweetSum_Summarization
Performing abstractive summarization on dialogue-based texts poses several potential challenges to SOTA deep-learning techniques, which are tested primarily on single-author texts. I compare the performance of three SOTA pre-trained abstractive text summarization models on the TweetSum (He et al., 2020) dataset. Final project for CIS6390: Special Topics in Computing.
Language:Jupyter Notebook51
HHousen/TransformerSum
Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
Language:Python42958

suhara

suhara's Stars

megagonlabs/napa

abetlen/llama-cpp-python

shizhediao/Post-Training-Data-Flywheel

NVIDIA/RULER

llm-jp/awesome-japanese-llm

karpathy/llm.c

shure-dev/Awesome-LLM-Papers-Comprehensive-Topics

NVIDIA/NeMo-Curator

ddhruvkr/CONTRADOC

NVIDIA/NeMo

NVIDIA/NeMo-Aligner

daviddao/awful-ai

hplt-project/sacremoses

allenai/RL4LMs

megagonlabs/doduo

gotutiyan/GEC-Info

gentaiscool/indonesian-nlp

meetdavidwan/factpegasus

mhagiwara/xfspell

abrazinskas/sigir2022-opinion-summarization-tutorial

megagonlabs/cocosum

ucinlp/autoprompt

KalaniStanton/LSTMvBERT-NER

sfs0126/Lyric-Generator-fine-tuned-GPT-2

vivienneprince/bookcovers-ml-python

tmccormack165/McCormack_Final_IMDb

lashleyaq/CellSegmentation

Chowlett2/Auto_Colorizer

sarahaman/CIS6930_TweetSum_Summarization

HHousen/TransformerSum