alphadl
NLP/ML researcher (developing GenAI and its human-centric applications).
JD Explore Academy, JD.com Inc.Shanghai(CN) & Sydney(AU)
Pinned Repositories
darts.pytorch1.1
Implementation with latest PyTorch (v1.1) for multi-gpu DARTS https://arxiv.org/abs/1806.09055
lookahead.pytorch
lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch
Neural-Machine-Translation
Several basic neural machine translation models implemented by PyTorch & TensorFlow
OOP-eval
The first Object-Oriented Programming (OOP) Evaluaion Benchmark for LLMs
proxifier_code
several free registration code for PROXIFIER
R1
🚀enhanced GRPO with more verifiable rewards and real-time evaluators
ErrorAnalysis_Prompt
:gift:[ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPT
ChatGPT4MT
🎁[ChatGPT4MT] Towards Making the Most of ChatGPT for Machine Translation
MT-Reading-List
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
ChatGPT-vs.-BERT
🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT
alphadl's Repositories
alphadl/proxifier_code
several free registration code for PROXIFIER
alphadl/R1
🚀enhanced GRPO with more verifiable rewards and real-time evaluators
alphadl/OOP-eval
The first Object-Oriented Programming (OOP) Evaluaion Benchmark for LLMs
alphadl/SafeLLM_with_IntentionAnalysis
Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting
alphadl/CodeGen-USCD
Code Gen with "Uncertainty Aware Selective Contrastive Decoding"
alphadl/LanguageAware_Tuning
langauge-aware tuning for building accurate cross-lingual LLMs
alphadl/SafeVLM_with_AMIA
Towards Safe LVM with Automatic Masking and Joint Intention Analysis
alphadl/AlignLLMHumanSurvey
Aligning Large Language Models with Human: A Survey
alphadl/Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". It includes KD Algorithms (Labeling, Self-Knowledge, Self-Rewarding, SFT, etc.) leveraging data augmentation or synthesis for getting knowledge, Skill Distillation (e.g. instruction following), and specialized applications (law, healthcare).
alphadl/Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
alphadl/deep_learning_curriculum
Language model alignment-focused deep learning curriculum
alphadl/Efficient-LLMs-Survey
Efficient Large Language Models: A Survey
alphadl/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
alphadl/agents
An Open-source Framework for Autonomous Language Agents
alphadl/AGI-survey
alphadl/alphadl
statistics
alphadl/Awesome-LLM-Safety
A curated list of security-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the security implications, challenges, and advancements surrounding these powerful models.
alphadl/FireAct
FireAct: Toward Language Agent Fine-tuning
alphadl/ICD
[ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding
alphadl/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
alphadl/LLM-Lite
clean LLM train/inference code
alphadl/LLMPapers
Papers & Works for large languange models (ChatGPT, GPT-3, Codex etc.).
alphadl/PromptBias
alphadl/RAM
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
alphadl/RLFromScratch
alphadl/rllm
Democratizing Reinforcement Learning for LLMs
alphadl/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
alphadl/Step-Audio
alphadl/Unified-MoE-Compression
The official implementation of the paper "Demystifying the Compression of Mixture-of-Experts Through a Unified Framework".
alphadl/X-R1
minimal-cost for training 0.5B R1-Zero