Pinned Repositories
ADMMBO
An implementation of "ADMMBO, An ADMM Framework for Bayesian Optimization with Unknown Constraints''
AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
botorch
Bayesian optimization in PyTorch
Bottleneck-Minimal-Indexing
Code for the paper "Bottleneck Minimal Indexing for Generative Document Retrieval" accepted by ICML 2024
bregman-proximal-dc-algorithm
Bregman Proximal type algorithms
cet
CET: Counterfactual Explanation Tree [AISTATS-22]
CGMRES-3languages
CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.
Inverse_Reinforcement_Learning
逆強化学習のサンプル
mathematical-engineering
数理工学の講義資料
Hideki105's Repositories
Hideki105/mathematical-engineering
数理工学の講義資料
Hideki105/Inverse_Reinforcement_Learning
逆強化学習のサンプル
Hideki105/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Hideki105/botorch
Bayesian optimization in PyTorch
Hideki105/Bottleneck-Minimal-Indexing
Code for the paper "Bottleneck Minimal Indexing for Generative Document Retrieval" accepted by ICML 2024
Hideki105/bregman-proximal-dc-algorithm
Bregman Proximal type algorithms
Hideki105/cet
CET: Counterfactual Explanation Tree [AISTATS-22]
Hideki105/CGMRES-3languages
Hideki105/dace
DACE: Distribution-Aware Counterfactual Explanation [IJCAI-20]
Hideki105/deep-learning-from-scratch-4
ゼロから作るDeep Learning④強化学習編
Hideki105/Diffusion-Models-pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
Hideki105/Diffusion-Policies-for-Offline-RL
Hideki105/linear-programming
主双対内点法による線形計画法
Hideki105/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
Hideki105/gpytorch
A highly efficient implementation of Gaussian Processes in PyTorch
Hideki105/graduate_exam
京都大学数学系の院試の問題と解答です
Hideki105/GraduateSchoolEntranceExamination
東京大学大学院情報理工学系研究科入試問題過去問解答など
Hideki105/kotaemon
An open-source RAG-based tool for chatting with your documents.
Hideki105/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Hideki105/manifold-optimization-book
『多様体上の最適化理論』サポートページ
Hideki105/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Hideki105/mogp
Mixture of Gaussian Processes Model for Sparse Longitudinal Data
Hideki105/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Hideki105/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Hideki105/python_simple_mppi
Python implementation of MPPI (Model Predictive Path-Integral) controller to understand the basic idea. Mandatory dependencies are numpy and matplotlib only.
Hideki105/riemannian-optimization
リーマン多様体上の最適化
Hideki105/robust-optimization
ロバスト最適化
Hideki105/robustOT
Robust Optimal Transport code
Hideki105/sam
SAM: Sharpness-Aware Minimization (PyTorch)
Hideki105/self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.