mihara-bot

CS Student in ECNU. Research interset: LLM, Ocean Engineering

mihara-bot's Stars

facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
Language:C++34k 481 2.6k3.8k
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
Language:Python30.7k 371 8.4k9.6k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python28.6k 243 2883.3k
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python6k 36 632521
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python6k 48 1.8k517
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python5.1k 27 667528
attardi/wikiextractor
A tool for extracting plain text from Wikipedia dumps
Language:Python3.8k 74 244975
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Language:Python2.3k 47 162175
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
Language:Python1.7k 46 106168
huggingface/lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Language:Python1.4k 28 246211
allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
Language:Python1.2k 24 81130
conversationai/perspectiveapi
Perspective is an API that uses machine learning models to score the perceived impact a comment might have on a conversation. See https://developers.perspectiveapi.com for more information.
913 49 0116
CarperAI/OpenELM
Evolution Through Large Models
Language:Python715 26 1186
Shark-NLP/OpenICL
OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.
Language:Python554 9 1830
hkust-nlp/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
Language:Python544 6 2729
subhadarship/kmeans_pytorch
kmeans using PyTorch
Language:Jupyter Notebook509 8 3782
princeton-nlp/LESS
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
Language:Jupyter Notebook424 5 3841
NUS-HPC-AI-Lab/InfoBatch
Lossless Training Speed Up by Unbiased Dynamic Data Pruning
Language:Python331 6 1718
sangmichaelxie/doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
Language:HTML316 4 3033
eth-sri/language-model-arithmetic
Controlled Text Generation via Language Model Arithmetic
Language:Python216 8 815
princeton-nlp/QuRating
[ICML 2024] Selecting High-Quality Data for Training Language Models
Language:Python159 6 713
SALT-NLP/demonstrated-feedback
Language:Python119 1 414
ahalterman/mordecai3
Full text geoparsing/toponym resolution with event geolocation
Language:Python74 6 1919
locuslab/scaling_laws_data_filtering
Language:Python64 3 04
skzhang1/IDEAL
IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models
Language:Python59 3 15
adymaharana/d2pruning
Language:Python32 2 25
cohere-ai/human-feedback-paper
Code and data from the paper 'Human Feedback is not Gold Standard'
Language:Jupyter Notebook19 12 01
lucy3/whos_filtered
Language:Python14 1 00
daeveraert/gradient-information-optimization
Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection
Language:Python12 1 11
xlhex/acl2024_xicl
Language:Python2 1 11

mihara-bot

mihara-bot's Stars

facebookresearch/faiss

open-mmlab/mmdetection

meta-llama/llama3

THUDM/GLM-4

InternLM/lmdeploy

open-compass/opencompass

attardi/wikiextractor

huggingface/datatrove

huggingface/nanotron

huggingface/lighteval

allenai/dolma

conversationai/perspectiveapi

CarperAI/OpenELM

Shark-NLP/OpenICL

hkust-nlp/deita

subhadarship/kmeans_pytorch

princeton-nlp/LESS

NUS-HPC-AI-Lab/InfoBatch

sangmichaelxie/doremi

eth-sri/language-model-arithmetic

princeton-nlp/QuRating

SALT-NLP/demonstrated-feedback

ahalterman/mordecai3

locuslab/scaling_laws_data_filtering

skzhang1/IDEAL

adymaharana/d2pruning

cohere-ai/human-feedback-paper

lucy3/whos_filtered

daeveraert/gradient-information-optimization

xlhex/acl2024_xicl