Pinned Repositories
Genetic_Insects_GPU_AMP
using C++ AMP on Microsoft Windwos for Genetic Algorithm
JoshieGo
A Go playing program implemented in Tensorflow roughly according to the architecture of AlphaGo. Current strength is 3~4 amateur dan.
Machine_learning
The homework of my 3rd-term
Matrixcal
Simple matrix calculation by MFC
simple-CNN
A homework of convolutional neural network
sjm1992st.github.io
Personal certificate
video_seg
视频片段提取
word2vec_win_vs2013
word2vec for vs2013
MillionHeroAssistant
百万 / 冲顶 / 芝士 / UC / 万能 答题助手(知识图谱更加专业,自动推荐答案, Android手机自动屏幕适配,模拟器支持,多开)
wechat_jump_game
微信《跳一跳》Python 辅助
sjm1992st's Repositories
sjm1992st/simple-CNN
A homework of convolutional neural network
sjm1992st/sjm1992st.github.io
Personal certificate
sjm1992st/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
sjm1992st/backtrader
Python Backtesting library for trading strategies
sjm1992st/BianQue
中文医疗对话模型扁鹊(BianQue)
sjm1992st/chatglm2-doctor
sjm1992st/clause
:horse_racing: Chatopera语义理解系统
sjm1992st/DeepRec
DeepRec is a recommendation engine based on TensorFlow.
sjm1992st/Emotional_Chatting
sjm1992st/Firefly
Firefly(流萤): 中文对话式大语言模型(全量微调+QLoRA),支持微调Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya、Bloom等大模型
sjm1992st/git-tips
:trollface:Git的奇技淫巧
sjm1992st/Hyponymy_Hypernym
The hyponymy and hypernym of some noun classes
sjm1992st/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
sjm1992st/Lunar-Solar-Calendar-Converter
公历(阳历)农历(阴历)转换,支持时间段从1900-2100
sjm1992st/models
Models and examples built with TensorFlow
sjm1992st/mydata
sjm1992st/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
sjm1992st/OUCML
sjm1992st/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
sjm1992st/PersonRelationKnowledgeGraph
ChinesePersonRelationGraph, person relationship extraction based on nlp methods.中文人物关系知识图谱项目,内容包括中文人物关系图谱构建,基于知识库的数据回标,基于远程监督与bootstrapping方法的人物关系抽取,基于知识图谱的知识问答等应用。
sjm1992st/PPT_PDF
My Profile
sjm1992st/PromptCBLUE
PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and zero-shot learning in the medical domain in Chinese
sjm1992st/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
sjm1992st/scikit-cuda
Python interface to GPU-powered libraries
sjm1992st/sjm1992st
sjm1992st/stable-diffusion
A latent text-to-image diffusion model
sjm1992st/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
sjm1992st/starcoder
Home of StarCoder: fine-tuning & inference!
sjm1992st/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
sjm1992st/trl
Train transformer language models with reinforcement learning.