yyht

Pinned Repositories

adaptive-quantization-modules
Code for "Online Learned Continual Compression with Adaptive Quantization Modules"
Language:Python1 1 00
BERT
modification of official bert for downstream task
Language:Python31 3 611
cail22_xxcq
Language:Python1 1 01
classifynet
text classification with various advanced modules and latest models such as leam, hard attention, multi-head attention
Language:Python9 2 14
Conv_Bert
Language:Python1 2 01
crust
[NeurIPS 2020] Coresets for Robust Training of Neural Networks against Noisy Labels
Language:Python1 1 01
DeepNER
天池中药说明书实体识别挑战冠军方案；中文命名实体识别；NER; BERT-CRF & BERT-SPAN & BERT-MRC；Pytorch
Language:Python5 2 02
deepspeech
deepspeech on tensorflow (1.x ) and supported for tpu, gpu
Language:Shell2 1 00
simnet
semantic similarity model
Language:Python7 3 07
UPSA
Language:Python2 1 011

yyht's Repositories

yyht/Agent-Pro
The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
Language:Python0 0
yyht/AI-Security-and-Privacy-Events
A curated list of academic events on AI Security & Privacy
0 0
yyht/ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
yyht/autogen-agi
AutoGen AGI: Advancing AI agents using AutoGen towards AGI capabilities. Explore cutting-edge enhancements in group chat dynamics, decision-making, and complex task proficiency. Join our journey in shaping AI's future!
Language:Python0 0
yyht/Awesome-LLM-Safety
A curated list of security-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the security implications, challenges, and advancements surrounding these powerful models.
0 0
yyht/deep_training
deep learning
Language:Python0 0
yyht/DeepSeek-Math
Language:Python0 0
yyht/detect-secrets
An enterprise friendly way of detecting and preventing secrets in code.
Language:Python0 0
yyht/evalplus
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023
Language:Python0 0
yyht/GuoFeng-Webnovel
Multilingual Corpus of Web Fiction
0 0
yyht/HALOs
A library with extensible implementations of DPO, KTO, PPO, and other human-aware loss functions (HALOs).
yyht/HybridAGI
The Programmable Neuro-Symbolic AGI that lets you program its behavior using Graph-based Prompt Programming: for people who want AI to behave as expected
yyht/ICDPO
The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization".
Language:Python0 0
yyht/k-diffusion-inverse-problems
Implementation of diffusion-based posterior sampling methods for inverse problems
Language:Python0 0
yyht/lingua-py
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Language:Python0 0
yyht/llm-verified-with-monte-carlo-tree-search
LLM verified with Monte Carlo Tree Search
Language:Python0 0
yyht/marker
Convert PDF to markdown quickly with high accuracy
Language:Python0 0
yyht/Noise-Contrastive-Alignment
Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards"
yyht/OPO
Language:Python0 0
yyht/orpo
Language:Python0 0
yyht/REBEL
Language:Python0 0
yyht/reflection-on-trees
Language:Python0 0
yyht/ReNeLLM
The official implementation of our NAACL 2024 paper "A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily".
yyht/SAELens
Training Sparse Autoencoders on Language Models
Language:HTML0 0
yyht/searchformer
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
Language:Jupyter Notebook0 0
yyht/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
Language:Python0 0
yyht/SemiReward
[ICLR 2024] SemiReward: A General Reward Model for Semi-supervised Learning
yyht/theorem-proving-reasoning
Code for the paper LeanReasoner: Boosting Complex Logical Reasoning with Lean: https://arxiv.org/pdf/2403.13312.pdf
yyht/unisim
UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.
yyht/URS
URS Benchmark: Evaluating LLMs on User Reported Scenarios
Language:Python0 0