vikas95
PhD student in Information and Computer Sciences department, University of Arizona, Tucson
University of ArizonaTucson
vikas95's Stars
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
huggingface/open-r1
Fully open reproduction of DeepSeek-R1
Jiayi-Pan/TinyZero
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
langchain-ai/langgraph
Build resilient language agents as graphs.
twitter/the-algorithm-ml
Source code for Twitter's Recommendation Algorithm
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
fchollet/ARC-AGI
The Abstraction and Reasoning Corpus
andrewekhalel/MLQuestions
Machine Learning and Computer Vision Engineer - Technical Interview Questions
NovaSky-AI/SkyThought
Sky-T1: Train your own O1 preview model within $450
mlabonne/llm-datasets
Curated list of datasets and tools for post-training.
openai/human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
codelion/optillm
Optimizing inference proxy for LLMs
castorini/pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
salesforce/WikiSQL
A large annotated semantic parsing corpus for developing natural language interfaces.
facebookresearch/multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
openai/grade-school-math
Shark-NLP/DiffuSeq
[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
jxzhangjhu/Awesome-LLM-Uncertainty-Reliability-Robustness
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
teacherpeterpan/Question-Generation-Paper-List
A summary of must-read papers for Neural Question Generation (NQG)
czyssrs/FinQA
Data and code for EMNLP 2021 paper "FinQA: A Dataset of Numerical Reasoning over Financial Data"
Sahandfer/EMPaper
This is a repository for sharing papers in the field of empathetic conversational AI. The related source code for each paper is linked if available.
shauryr/ACL-anthology-corpus
This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs
Sahandfer/PersonaPaper
This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linked if available.
FudanSELab/ClassEval
Benchmark ClassEval for class-level code generation.
amikelive/coco-labels
The labels for object categories in COCO dataset
GasolSun36/Iter-CoT
[NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models
prdwb/orconvqa-release
anuradha1992/EDOS
A Large-Scale Dataset for Empathetic Response Generation
AIM3-RUC/MPMQA
Official repository of the paper MPMQA: Multimodal Question Answering on Product Manuals (AAAI 2023)