mihirp1998

researcher at CMU MLD

Pittsburgh

Pinned Repositories

trl
Train transformer language models with reinforcement learning.
Language:Python9.3k 73 1.1k1.2k
Adversarial-Inverse-Graphics-Networks-for-Faces
Language:Python4 1 01
AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language:Python228 6 137
Diffusion-TTA
Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.
Language:Python49 4 104
Disentangling-3D-Prototypical-Nets
We present neural architectures that disentangle RGB-D images into objects' shapes and styles and a map of the background scene, and explore their applications for few-shot 3D object detection and few-shot concept classification.
Language:Python11 2 10
EmbLang
Embodied Language Grounding With 3D Visual Feature Representations
Language:Python4 1 11
Navigation-Deep-RL
This is a rep for navigating in unity environment using deep q learning network in pytroch
Language:Jupyter Notebook2 1 01
ProbabilisticNeuralProgrammedNetwork_Tensorflow
Code for "Probabilistic Neural Programmed Networks for Scene Generation.", Deng et al, NIPS 2018. This Code Base is ported Tensorflow 2.0 version of the official Pytorch Implementation
Language:Python8 1 03
Slot-TTA
Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.
Language:Python23 3 23
VADER
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
Language:Python195 7 1415

mihirp1998's Repositories

mihirp1998/Navigation-Deep-RL
This is a rep for navigating in unity environment using deep q learning network in pytroch
Language:Jupyter Notebook2 1 01
mihirp1998/flask-webcam-recorder
A client server repository for collecting video from the user and storing it on server side
Language:JavaScript1 1 00
mihirp1998/Association-Net
This is an unofficial implementation of the Paper "Learning Feature Hierarchies from Long-Range Temporal Associations in Videos" By Panna Felsen, Katerina Fragkiadaki, Jitendra Malik, Alexei Efros
Language:Jupyter Notebook1 0
mihirp1998/continuous-control
Udacity deep reinforcement learning continuous control project
Language:Jupyter Notebook1 0
mihirp1998/Continuous-control-using-Policy-Based-RL-Method
Language:Jupyter Notebook1 01
mihirp1998/convHypernetComp_xav
Language:Python1 0
mihirp1998/cs224d
Code for Stanford CS224D: deep learning for natural language understanding
Language:Python2 0
mihirp1998/deep-reinforcement-learning
Repo for the Deep Reinforcement Learning Nanodegree program
Language:Jupyter Notebook1 0
mihirp1998/FaceCropper
Language:Python1 0
mihirp1998/FB-Bot
Fbbot for Home Automation made using nodejs
Language:JavaScript2 0
mihirp1998/generating-reviews-discovering-sentiment
Code for "Learning to Generate Reviews and Discovering Sentiment"
Language:Python2 0
mihirp1998/GodSpeaks
A parallel version of PeterAnswers made using Nodejs
Language:JavaScript1 0
mihirp1998/hostel
Language:HTML1 0
mihirp1998/Kinetic-Dataset
dataset
Language:Python1 0
mihirp1998/lstmHypernetComp
Language:Jupyter Notebook2 0
mihirp1998/pose-tensorflow
Human Pose estimation with TensorFlow framework
Language:C++1 0
mihirp1998/ProbabilisticNeuralProgrammedNetwork
Code for "Probabilistic Neural Programmed Networks for Scene Generation.", Deng et al, NIPS 2018
Language:Python2 0
mihirp1998/ProgrammingRNN
RNN Language Modelling on Java Code
Language:Jupyter Notebook1 0
mihirp1998/pytorch-vci_cpg
Language:Python2 0
mihirp1998/pytorch-vcii
Video Compression through Image Interpolation (ECCV'18) [PyTorch]
Language:Python2 0
mihirp1998/Reacher-Continuous-Control
Udacity Deep Reinforcement Learning Nanodegree Program - Continuous Control
Language:ASP
mihirp1998/residual_temp
Language:Jupyter Notebook1 0
mihirp1998/residualconvhypernet
Language:Jupyter Notebook
mihirp1998/SpeakAI
Speaker Verification System
Language:Jupyter Notebook1 01
mihirp1998/Tennis_multiagent-RL
Language:Jupyter Notebook1 0
mihirp1998/tensorflow
Computation using data flow graphs for scalable machine learning
Language:C++1 0
mihirp1998/Tensorflow-Regression-WebApp
Webapp made using flask and tensorflow for regression prediction
Language:CSS1 0
mihirp1998/Text-Summarize
Language:Jupyter Notebook1 0
mihirp1998/unity-banana-navigation
Project 1 of Udacity Deep Reinforcement Learning Nanodegree
Language:Python2 0
mihirp1998/work_server
Language:C1 0