- 因子图部分链接尚未补充完全

TODO List

机器学习

模式识别

图书

《Bishop Pattern Recognition and Machine Learning》 by Christopher M. Bishop
《PRML：模式识别与机器学习(中文版)》 by 马春鹏

深度学习

图书

《Deep Learning》 by Ian Goodfellow, Yoshua Bengio, Aaron Courville
《深度学习-中文》 by Ian Goodfellow, Yoshua Bengio, Aaron Courville Git Hub开源翻译
《神经网络与深度学习》 by 邱锡鹏

强化学习

图书

《Reinforcement Learning: An Inroduction》 by Richard S. Sutton, Andrew G. Barto [website]
简短翻译版-强化学习导论.pdf 来自网友
《Algorithms for Reinforcement Learning》 by Csaba Szepesv´ari
《A Concise Introduction to Decentrakuzed POMDPs》 by Oliehoed, Amato

课程

李宏毅-主页, 强化学习课程视频

优质笔记(https://datawhalechina.github.io/easy-rl/)

David Silver主页, 课程视频

伯克利2018强化学习课程

实战

基于SMAC的PYMARL平台 GitHub 地址https://github.com/oxwhirl/pymarl
百度PaddlePaddle工程师实训教程--视频https://www.bilibili.com/video/BV1yv411i7xd

Baidu AI-Studio课程
项目代码PRAL GitHub

扩展知识

POMDPs介绍--Pages

论文

基于值函数的强化学习方法

动态规划算法
蒙特卡罗算法
时序差分学习方法

Sarsa 和Q-learning：https://zhuanlan.zhihu.com/p/46850008

基于策略的强化学习方法

策略梯度

特点：处理连续动作和随机策略

介绍：Policy Gradient Methods for Reinforcement Learning with Function Approximation
Reinforce算法
带基线的Reinforce算法

特点：减少方差

基于值函数和策略的结合

Actor-Critic算法

特点：使用Q函数减少方差

介绍：Policy Gradient Methods for Reinforcement Learning with Function Approximation
A2C

特点：使用优势函数减少方差

介绍：https://openai.com/blog/baselines-acktr-a2c/
A3C

特点：多线程

介绍：Asynchronous Methods for Deep Reinforcement Learning.

以上三节参考：强化学习value-based&policy-based.pptx

深度强化学习

DQN

介绍：Playing Atari with Deep Reinforcement Learning
Nature DQN

介绍：Human-level control through deep reinforcement learning
Double DQN (DDQN)

介绍：Deep Reinforcement Learning with Double Q-learning
Dueling DQN

介绍：Dueling Network Architectures for Deep Reinforcement Learning

DQN及其变体介绍：https://zhuanlan.zhihu.com/p/106411995
DPG

介绍：Deterministic Policy Gradient Algorithms
DDPG

介绍：Continuous Control with Deep Reinforcement Learning
MADDPG

介绍：multi-agent actor-critic for mixed cooperative-competitive environments

RL热点问题

因子图（Factor Graph）

1 因子图与和积算法

相关网页
概率图的推断——变量消除、信念传播、因子图、道德图、联结树
以一个例子讲述因子图为何以及如何进行计算。
因子图与和积算法简介(CSDN)
出自论文factor graph and sum-product algorithm
因子图与和积算法简介(知乎)
出自论文factor graph and sum-product algorithm与上一个链接内容相比，对因子图定义的形式化描述更多。

相关论文
An introduction to factor graph
本文讲述因子图的发展过程，并给出两种形式的因子图：标准形式、Forney形式。介绍了LDPC码、卡尔曼滤波等应用与因子图上的例子。
因子图与和积算法简介(CSDN)
出自论文factor graph and sum-product algorithm。
因子图与和积算法简介(知乎)
出自论文factor graph and sum-product algorithm。与上一个链接内容相比，对因子图定义的形式化描述更多。

其他材料

SRTP因子图项目报告
课题名为：“实现信息融合的因子图可视化设计”。描述了因子图定义以及各种算法，并进行仿真实验设计

2 信念传播

2.1 信念传播算法

2.2 循环信念传播

其他材料

Metacademy课程:循环信念传播与变分推理

metacademy是一个网站，其可以看作机器学习和人工智能的知识图谱

[word文档]LBP论文笔记

简单介绍了LBP算法，并简单推导了Loopy belief propagation based data association for extended target tracking中的部分因子图

3 因子图代码实现

3.1 matlab代码实现

3.2 Julia代码实现

3.2.1 Julia安装

其他材料

[word文档]Julia安装流程

对网上的安装流程做出总结，给出了几个可行的安装方法

3.2.2 forneyLab工具箱的使用

4 因子图扩展

4.1 因子图约束

4.2 BP算法的粒子化

其他材料

[ppt文档]Understanding and Accelerating Particle-Based Variational Inference的讲解

4.3 因子图与粒子滤波

4.4 因子图与协同网络

其他材料

[rar文件]PLBP算法的代码实现？

SICC-Group/Learning-Materials

机器学习

模式识别

图 书

深度学习

图 书

强化学习

图 书

课 程

实 战

扩展知识

论 文

综 述

算 法

强化学习基本概念

基于值函数的强化学习方法

基于策略的强化学习方法

基于值函数和策略的结合

深度强化学习

RL热点问题

因子图 （Factor Graph）

1 因子图与和积算法

其他材料

2 信念传播

2.1 信念传播算法

相关论文

2.2 循环信念传播

相关论文

其他材料

3 因子图代码实现

3.1 matlab代码实现

相关网页

3.2 Julia代码实现

3.2.1 Julia安装

相关网页

相关论文

其他材料

3.2.2 forneyLab工具箱的使用

相关网页

相关论文

4 因子图扩展

4.1 因子图约束

相关论文

4.2 BP算法的粒子化

相关论文

其他材料

4.3 因子图与粒子滤波

相关论文

4.4 因子图与协同网络

相关论文

其他材料

图书

图书

图书

课程

实战

论文

综述

算法

因子图（Factor Graph）