Pinned Repositories
Chinese-Essay-Dataset-For-Organization-Evaluation
A Chinese argumentative student essay dataset for Organization Evaluation and Discourse Element Identification
Chinese_Semantic_Dependency_Parser_with_knowledge
classroom-management-system
a web system to manage classroom
Interface-demo-of-Producer-consumer-problem
A dynamics Demonstration
nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
PhraseExtractor
Semantic-dependency-corpus_process_codes
Text-similarity-comparison-and-clustering-system
Used JAVA
Topic_and_user_profile_analysis_system
基于微博的网络舆情话题分析和用户画像系统
Web-browser-developed-by-javafx
a simple browser
a101269's Repositories
a101269/PhraseExtractor
a101269/Web-Service-QoS-data-acquiring-manage-and-predict-platform
Web-Service-QoS-data-acquiring-manage-and-predict-platform,developed by Django.
a101269/agentlego
a101269/AI_Chinese_DataSet_KnowledgeDAO
供AI训练的中文数据集(持续更新。。。)与AI公司图谱,目前的数据集餐饮行业8000问,百度知道,Alpaca中文数据集,计算机领域数据集,Vicuna数据集,RedPajama数据集,Wikipedia中文词条数据集,网站论坛问答数据集
a101269/AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
a101269/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
a101269/bias-bench
ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.
a101269/ChatGLM-6B
ChatGLM-6B:开源双语对话语言模型
a101269/DatasMark
离线版中文标注工具,支持NER、文本分类、关系标注、对话标注等。
a101269/Debias
The code of "Take its Essence, Discard its Dross! Debiasing for Toxic Language Detection via Counterfactual Causal Effect" (LREC-COLING2024).
a101269/Detox-CoT
a101269/DExperts
code associated with ACL 2021 DExperts paper
a101269/doc-story-generation
a101269/LLM-IR-Bias-Fairness-Survey
This is the repo for the survey of Bias and Fairness in IR with LLMs.
a101269/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
a101269/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
a101269/MoneyPrinter
Automate Creation of YouTube Shorts using MoviePy.
a101269/MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
a101269/MRC_Competition_Dureader
机器阅读理解 冠军/亚军代码及中文预训练MRC模型
a101269/OPO
a101269/PPLM
Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
a101269/RLHF
Implementation of Chinese ChatGPT
a101269/RLHF-Label-Tool
用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.
a101269/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
a101269/Safety-Prompts
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。
a101269/Summarization-Papers
Summarization Papers
a101269/ToSarcasm
Dataset (ToSarcasm) and Code (TOSPrompt) for CCL 2022 best paper: 面向话题的讽刺识别:新任务、新数据和新方法(Topic-Oriented Sarcasm Detection: New Task, New Dataset and New Method)
a101269/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF etc.
a101269/Unified-Debiasing-and-Detoxifying
Code for the ICLR 2023 paper: Unified Detoxifying and Debiasing in Language Generation via Inference-time Adaptive Optimization
a101269/WebCPM
Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"