Pinned Repositories
AGENT-synthesis
Data synthesis code for "AGENT: A Benchmark for Core Psychological Reasoning"
Algorithmic_Problems
alpaca-rlhf
Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat
BioAsqQA_System
find_fallen_objects
Official implementation of CVPR 2022 paper "Finding Fallen Objects Via Asynchronous Audio-Visual Integration".
FishNet
Code for Kaggles "Nature Conservancy Fisheries" competition
Parallel-Computing-Class
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
UNCCQA
Code for Bioasq UNCC QA system
OPEn
abhi1092's Repositories
abhi1092/Parallel-Computing-Class
abhi1092/BioAsqQA_System
abhi1092/UNCCQA
Code for Bioasq UNCC QA system
abhi1092/AGENT-synthesis
Data synthesis code for "AGENT: A Benchmark for Core Psychological Reasoning"
abhi1092/Algorithmic_Problems
abhi1092/alpaca-rlhf
Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat
abhi1092/find_fallen_objects
Official implementation of CVPR 2022 paper "Finding Fallen Objects Via Asynchronous Audio-Visual Integration".
abhi1092/FishNet
Code for Kaggles "Nature Conservancy Fisheries" competition
abhi1092/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
abhi1092/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
abhi1092/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
abhi1092/coding_textbooks
abhi1092/Computer-Vision-and-Digital-Image-processing
abhi1092/crowdplay
CrowdPlay is a platform for crowdsourcing human demonstration trajectories in RL environments.
abhi1092/Deep-Learning-Vault
A collection of videos, blogs, slides and research paper related to deep learning and AGI
abhi1092/DeepSpeedExamples
Example models using DeepSpeed
abhi1092/EvalAI-Starters
How to create a challenge on EvalAI?
abhi1092/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
abhi1092/gym
A toolkit for developing and comparing reinforcement learning algorithms.
abhi1092/ibm-history-documents
Repository designed to host documents relating to IBM's glorious history
abhi1092/ProgrammingAssignment2
Repository for Programming Assignment 2 for R Programming on Coursera
abhi1092/python-flask-app
Start building your next Python Flask app on IBM Cloud.
abhi1092/qna_guideline
This is a temporary repository for working on Knowledge QNA guidelines
abhi1092/Random-Forest-RDD
Random Forest using RDD
abhi1092/RNN_langauge_model_jokes_corpus
abhi1092/sdg
Python library for Synthetic Data Generation
abhi1092/Sentiment_Analysis
abhi1092/taxonomy
Taxonomy tree that will allow you to create models tuned with your data
abhi1092/trl
Train transformer language models with reinforcement learning.
abhi1092/uncc-thesis-latex
This repository provides LaTeX class (.cls) and style (.sty) files that facilitate specifying Masters and Doctoral thesis documents that conform to the specifications of the UNC Charlotte graduate school.