KaiQiangSong
Senior Research Scientist @ Tencent AI Lab, Interested in NLP, LLM, Text Generation, and Summarization. Hiring Interns
Tencent AI LabBellevue, WA
KaiQiangSong's Stars
xai-org/grok-1
Grok open release
openai/openai-python
The official Python library for the OpenAI API
BuilderIO/gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
breezedeus/Pix2Text
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
databricks/megablocks
andyzoujm/representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
huggingface/transformers-bloom-inference
Fast Inference Solutions for BLOOM
epfLLM/Megatron-LLM
distributed trainer for LLMs
GAIR-NLP/MathPile
Generative AI for Math: MathPile
OpenLMLab/LEval
[ACL'24] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
Zaki-1052/GPTPortal
A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight Node.js web app; supports customizable multimodality for voice, images, & files.
ProjectD-AI/LLaMA-Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
FuxiaoLiu/MMC
[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning
qinyiwei/InfoBench
YebowenHu/MeetingBank-utils
tencent-ailab/zebra-inference
tencent-ailab/FOLNet
This repository includes the code for First-Order Logic Network (FOLNet).
rail-cwru/rankboostplus
Implementations of various ranking-by-boosting algorithms including Rankboost+.
KaiQiangSong/InfoBench
sangwoo3/lit-GPT
tjruwase/transformers-bloom-inference
Fast Inference Solutions for BLOOM