Pinned Repositories
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
octopack
🐙 OctoPack: Instruction Tuning Code Large Language Models
lorahub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
regmix
🧬 RegMix: Data Mixture as Regression for Language Model Pre-training
sailor-llm
[EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia
scaling-with-vocab
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
BUAAOS-guide-book
北航小操作系统实验指导书
code-html-to-markdown
A lightweight script for processing HTML page to markdown format with support for code blocks
Graph-Neural-Network-Note
A blog for understanding graph neural network
Persona-Dialogue-Generation
The code of ACL 2020 paper "You Impress Me: Dialogue Generation via Mutual Persona Perception"
SivilTaram's Repositories
SivilTaram/Persona-Dialogue-Generation
The code of ACL 2020 paper "You Impress Me: Dialogue Generation via Mutual Persona Perception"
SivilTaram/code-html-to-markdown
A lightweight script for processing HTML page to markdown format with support for code blocks
SivilTaram/Calculator
阿超的四则运算生成器 v1.0
SivilTaram/Awesome-Prompt-Engineering
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
SivilTaram/LM-reasoning
This repository contains a collection of papers and resources on Reasoning in Large Language Models.
SivilTaram/santacoder-finetuning-commit
Fine-tune SantaCoder for Code/Text Generation.
SivilTaram/SivilTaram.github.io
SivilTaram/axolotl
Go ahead and axolotl questions
SivilTaram/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
SivilTaram/bytepiece
更纯粹、更高压缩率的Tokenizer
SivilTaram/catwalk
This project studies the performance and robustness of language models and task-adaptation methods.
SivilTaram/commits
SivilTaram/dclm
DataComp for Language Models
SivilTaram/extract-expert
Extract a single expert from an MoE model of Mixtral architecture, using slerp
SivilTaram/GPT-classification-example
OpenAI gpt classification fine-tuning example.
SivilTaram/guidance
A guidance language for controlling large language models.
SivilTaram/infinigen
Infinite Photorealistic Worlds using Procedural Generation
SivilTaram/InstructionWild
SivilTaram/Megatron-LLM
distributed trainer for LLMs
SivilTaram/mergekit
Tools for merging pretrained large language models.
SivilTaram/oat
🌾 OAT: Online AlignmenT for LLMs
SivilTaram/OpenAgents
OpenAgents: An Open Platform for Language Agents in the Wild
SivilTaram/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
SivilTaram/primeqa
The prime repository for state-of-the-art Multilingual Question Answering research and development.
SivilTaram/sailcraft
Data Toolkit for Sailor Language Models
SivilTaram/SivilTaram.github.io.v1
personal online resume
SivilTaram/surya
Accurate line-level text detection and recognition (OCR) in any language
SivilTaram/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
SivilTaram/Triton-Puzzles
Puzzles for learning Triton
SivilTaram/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs