SHITIANYU-hue
Ph.D. student @ University of Toronto Reinforcement learning
University of TorontoToronto, Canada
SHITIANYU-hue's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
taichi-dev/taichi
Productive, portable, and performant GPU programming in Python.
microsoft/qlib
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL.
BlueMatthew/WechatExporter
Wechat Chat History Exporter 微信聊天记录导出备份程序
OpenBMB/AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
datamllab/rlcard
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
openai/weak-to-strong
Ucas-HaoranWei/Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Yvictor/TradingGym
Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.
OpenMotionLab/MotionGPT
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
TradeMaster-NTU/TradeMaster
TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning :fire: :zap: :rainbow:
curiousily/Get-Things-Done-with-Prompt-Engineering-and-LangChain
LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.
waymo-research/waymax
A JAX-based simulator for autonomous driving research.
intelligent-environments-lab/CityLearn
Official reinforcement learning environment for demand response and load shaping
wayveai/Driving-with-LLMs
PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"
Raschka-research-group/coral-cnn
Rank Consistent Ordinal Regression for Neural Networks with Application to Age Estimation
robo-alex/awesome-scene-representation
A curated list of awesome scene representation(NeRFs) papers, code, and resources.
CR-Gjx/Suspicion-Agent
The implementation of "Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4"
decisionforce/CoPO
[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".
i-gallegos/Fair-LLM-Benchmark
kaiwenzha/Rank-N-Contrast
[NeurIPS 2023, Spotlight] Rank-N-Contrast: Learning Continuous Representations for Regression
git-disl/EllipticPlusPlus
Elliptic++ Dataset: A Graph Network of Bitcoin Blockchain Transactions and Wallet Addresses
AltmanD/guandan_mcc
mcc_second_guandan
BorealisAI/ranksim-imbalanced-regression
[ICML 2022] RankSim: Ranking Similarity Regularization for Deep Imbalanced Regression
SmartMobilityAlgorithms/book
This is the site for ECE1724H: Bio-inspired Algorithms for Smart Mobility, https://smartmobilityalgorithms.github.io/book/index.html
gongsixue/DebFace
tml2002/RoleCraft
ethz-msrl/mCR_simulator
This is the companion code to the submission A Simulation Framework for Magnetic Continuum Robots, Dreyfusr R., Boehler Q., Nelson B.J.
alantes/RL-for-MSRs
An implementation of using rl to control magnetic soft robots.
Junang-Wang/Qubot_Elastica