ashokurlana's Stars
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Facico/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
SinclairCoder/Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
RenzeLou/awesome-instruction-learning
Papers and Datasets on Instruction Tuning and Following. ✨✨✨
rohithreddy024/Text-Summarizer-Pytorch
Pytorch implementation of "A Deep Reinforced Model for Abstractive Summarization" paper and pointer generator network
IamAdiSri/hf-trim
Reduce the size of pretrained Hugging Face models via vocabulary trimming.
bhaddow/pmindia-crawler
Code for extracting parallel corpora from pmindia
zsquaredz/adapt_vs_finetune
This repository contains code for paper "To Adapt or to Fine-tune: A Case Study on Abstractive Summarization" which appears in CCL 2022.
Pruthwik/Tokenizer_for_Indian_Languages
Tokenizer For Indian Languages
tingc9/Cross-Sum-News-Aligned
Continually updated repository housing cross lingual and mono lingual summarization data for different language pairs.
lokeshmadasu42/Mukhyansh
This repository contains Mukhyansh dataset and code base.
manshri/TeSum
pavanbaswani/rhetorical_roles
This repository is for rhetorical roles prediction conducted by the SemEval 2023
Pruthwik/Urdu-Tokenizer
Sentence and Word Tokenize Urdu Data