Hideki105

数理工学

Tokyo

Pinned Repositories

ADMMBO
An implementation of "ADMMBO, An ADMM Framework for Bayesian Optimization with Unknown Constraints''
Language:MATLAB0 0 00
AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬
Language:Jupyter Notebook0 0 00
botorch
Bayesian optimization in PyTorch
Language:Jupyter Notebook0 0 00
Bottleneck-Minimal-Indexing
Code for the paper "Bottleneck Minimal Indexing for Generative Document Retrieval" accepted by ICML 2024
Language:Python0 0 00
bregman-proximal-dc-algorithm
Bregman Proximal type algorithms
Language:Python0 0 00
cet
CET: Counterfactual Explanation Tree [AISTATS-22]
Language:Python0 0 00
CGMRES-3languages
Language:C++0 0 00
CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.
Language:Python0 0 00
Inverse_Reinforcement_Learning
逆強化学習のサンプル
1 2 00
mathematical-engineering
数理工学の講義資料
3 1 00

Hideki105's Repositories

Hideki105/mathematical-engineering
数理工学の講義資料
3 1 00
Hideki105/Inverse_Reinforcement_Learning
逆強化学習のサンプル
1 2 00
Hideki105/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬
Language:Jupyter Notebook0 0 00
Hideki105/botorch
Bayesian optimization in PyTorch
Language:Jupyter Notebook0 0 00
Hideki105/Bottleneck-Minimal-Indexing
Code for the paper "Bottleneck Minimal Indexing for Generative Document Retrieval" accepted by ICML 2024
Language:Python0 0 00
Hideki105/bregman-proximal-dc-algorithm
Bregman Proximal type algorithms
Language:Python0 0 00
Hideki105/cet
CET: Counterfactual Explanation Tree [AISTATS-22]
Language:Python0 0 00
Hideki105/CGMRES-3languages
Language:C++0 0 00
Hideki105/dace
DACE: Distribution-Aware Counterfactual Explanation [IJCAI-20]
Language:Python0 0 00
Hideki105/deep-learning-from-scratch-4
ゼロから作るDeep Learning④強化学習編
Language:Jupyter Notebook0 0 00
Hideki105/Diffusion-Models-pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
Language:Python0 0 00
Hideki105/Diffusion-Policies-for-Offline-RL
Language:Python00
Hideki105/linear-programming
主双対内点法による線形計画法
Language:Python00
Hideki105/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
Hideki105/gpytorch
A highly efficient implementation of Gaussian Processes in PyTorch
Hideki105/graduate_exam
京都大学数学系の院試の問題と解答です
Language:TeX0 0
Hideki105/GraduateSchoolEntranceExamination
東京大学大学院情報理工学系研究科入試問題過去問解答など
Language:TeX0 0
Hideki105/kotaemon
An open-source RAG-based tool for chatting with your documents.
Language:Python0 0
Hideki105/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Language:Python0 0
Hideki105/manifold-optimization-book
『多様体上の最適化理論』サポートページ
Hideki105/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Language:Python0 0
Hideki105/mogp
Mixture of Gaussian Processes Model for Sparse Longitudinal Data
Hideki105/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python0 0
Hideki105/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:Python0 0
Hideki105/python_simple_mppi
Python implementation of MPPI (Model Predictive Path-Integral) controller to understand the basic idea. Mandatory dependencies are numpy and matplotlib only.
Hideki105/riemannian-optimization
リーマン多様体上の最適化
Language:Python1 0
Hideki105/robust-optimization
ロバスト最適化
Language:Python
Hideki105/robustOT
Robust Optimal Transport code
Hideki105/sam
SAM: Sharpness-Aware Minimization (PyTorch)
Hideki105/self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.