Pinned Repositories
trl
Train transformer language models with reinforcement learning.
Adversarial-Inverse-Graphics-Networks-for-Faces
AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Diffusion-TTA
Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.
Disentangling-3D-Prototypical-Nets
We present neural architectures that disentangle RGB-D images into objects' shapes and styles and a map of the background scene, and explore their applications for few-shot 3D object detection and few-shot concept classification.
EmbLang
Embodied Language Grounding With 3D Visual Feature Representations
Navigation-Deep-RL
This is a rep for navigating in unity environment using deep q learning network in pytroch
ProbabilisticNeuralProgrammedNetwork_Tensorflow
Code for "Probabilistic Neural Programmed Networks for Scene Generation.", Deng et al, NIPS 2018. This Code Base is ported Tensorflow 2.0 version of the official Pytorch Implementation
Slot-TTA
Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.
VADER
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
mihirp1998's Repositories
mihirp1998/Navigation-Deep-RL
This is a rep for navigating in unity environment using deep q learning network in pytroch
mihirp1998/flask-webcam-recorder
A client server repository for collecting video from the user and storing it on server side
mihirp1998/Association-Net
This is an unofficial implementation of the Paper "Learning Feature Hierarchies from Long-Range Temporal Associations in Videos" By Panna Felsen, Katerina Fragkiadaki, Jitendra Malik, Alexei Efros
mihirp1998/continuous-control
Udacity deep reinforcement learning continuous control project
mihirp1998/Continuous-control-using-Policy-Based-RL-Method
mihirp1998/convHypernetComp_xav
mihirp1998/cs224d
Code for Stanford CS224D: deep learning for natural language understanding
mihirp1998/deep-reinforcement-learning
Repo for the Deep Reinforcement Learning Nanodegree program
mihirp1998/FaceCropper
mihirp1998/FB-Bot
Fbbot for Home Automation made using nodejs
mihirp1998/generating-reviews-discovering-sentiment
Code for "Learning to Generate Reviews and Discovering Sentiment"
mihirp1998/GodSpeaks
A parallel version of PeterAnswers made using Nodejs
mihirp1998/hostel
mihirp1998/Kinetic-Dataset
dataset
mihirp1998/lstmHypernetComp
mihirp1998/pose-tensorflow
Human Pose estimation with TensorFlow framework
mihirp1998/ProbabilisticNeuralProgrammedNetwork
Code for "Probabilistic Neural Programmed Networks for Scene Generation.", Deng et al, NIPS 2018
mihirp1998/ProgrammingRNN
RNN Language Modelling on Java Code
mihirp1998/pytorch-vci_cpg
mihirp1998/pytorch-vcii
Video Compression through Image Interpolation (ECCV'18) [PyTorch]
mihirp1998/Reacher-Continuous-Control
Udacity Deep Reinforcement Learning Nanodegree Program - Continuous Control
mihirp1998/residual_temp
mihirp1998/residualconvhypernet
mihirp1998/SpeakAI
Speaker Verification System
mihirp1998/Tennis_multiagent-RL
mihirp1998/tensorflow
Computation using data flow graphs for scalable machine learning
mihirp1998/Tensorflow-Regression-WebApp
Webapp made using flask and tensorflow for regression prediction
mihirp1998/Text-Summarize
mihirp1998/unity-banana-navigation
Project 1 of Udacity Deep Reinforcement Learning Nanodegree
mihirp1998/work_server