Pinned Repositories
LLM-eval-survey
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
PandaLM
Distant-Supervision
Code for AICS paper: "Multi-level Attention-Based Neural Networks for Distant Supervised Relation Extraction"
Explainable-Financial-Text-Classification
Repository for COLING-20 Paper: Generating Plausible Counterfactual Explanations for Deep Transformers in Financial Text Classification
Fine-grained-Causal-Reasoning
We introduce a novel fine-grained causal reasoning dataset and present a series of novel tasks in NLP, from causality detection to event causality extraction and Causal QA. Our dataset contains human annotations of 25K cause-effect event pairs and 24K question-answering pairs within multi-sentence samples.
FinNLP-Progress
NLP progress in Fintech. A repository to track the progress in Natural Language Processing (NLP) related to the domain of Finance, including the datasets, papers, and current state-of-the-art results for the most popular tasks.
GLUE-X
We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that the OOD accuracy in NLP tasks needs to be paid more attention to since the significant performance decay compared to ID accuracy has been found in all settings.
HTML-Hierarchical-Transformer-based-Multi-task-Learning-for-Volatility-Prediction
Code for WWW-20 Paper: HTML: Hierarchical Transformer-based Multi-task Learning for Volatility Prediction
Supervised-Knowledge-Makes-Large-Language-Models-Better-In-context-Learners
Repository for ICLR 2024 Paper: Supervised Knowledge Makes Large Language Models Better In-context Learners. We offer a comprehensive suite of resources, including 16 curated datasets, prompts, model checkpoints, and LLM outputs across 9 distinct tasks.
YangLinyi's Repositories
YangLinyi/FinNLP-Progress
NLP progress in Fintech. A repository to track the progress in Natural Language Processing (NLP) related to the domain of Finance, including the datasets, papers, and current state-of-the-art results for the most popular tasks.
YangLinyi/GLUE-X
We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that the OOD accuracy in NLP tasks needs to be paid more attention to since the significant performance decay compared to ID accuracy has been found in all settings.
YangLinyi/HTML-Hierarchical-Transformer-based-Multi-task-Learning-for-Volatility-Prediction
Code for WWW-20 Paper: HTML: Hierarchical Transformer-based Multi-task Learning for Volatility Prediction
YangLinyi/Fine-grained-Causal-Reasoning
We introduce a novel fine-grained causal reasoning dataset and present a series of novel tasks in NLP, from causality detection to event causality extraction and Causal QA. Our dataset contains human annotations of 25K cause-effect event pairs and 24K question-answering pairs within multi-sentence samples.
YangLinyi/Explainable-Financial-Text-Classification
Repository for COLING-20 Paper: Generating Plausible Counterfactual Explanations for Deep Transformers in Financial Text Classification
YangLinyi/Supervised-Knowledge-Makes-Large-Language-Models-Better-In-context-Learners
Repository for ICLR 2024 Paper: Supervised Knowledge Makes Large Language Models Better In-context Learners. We offer a comprehensive suite of resources, including 16 curated datasets, prompts, model checkpoints, and LLM outputs across 9 distinct tasks.
YangLinyi/Leveraging-BERT-to-Improve-the-FEARS-Index-for-Stock-Forecasting
Leveraging BERT to Improve the FEARS Index for Stock Forecasting
YangLinyi/SVM-CNN-RNN-HAN-Popular-NLP-Models
Four easy to use baselines for NLP research
YangLinyi/FinTrust
Code for ACL-2023 paper "Measuring Consistency in Text-based Financial Forecasting Models"
YangLinyi/a-rationale-centric-framework-for-human-in-the-loop-machine-learning
Implementation for ACL 2022 paper: A Rationale-Centric Framework For Human-in-the-loop Machine Learning.
YangLinyi/XAI-Dataset-and-Benchmark
This repo aims to provide an easy-to-use and robust benchmark to a broad range of NLP tasks for evaluating XAI methods.
YangLinyi/academic-kickstart-linyi
YangLinyi/Counterfactuals-for-Sentiment-Analysis
Code and Data for ACL-21 paper: "Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis"
YangLinyi/DLQuant
Applying Deep Learning and NLP in Quantitative Trading
YangLinyi/EarningsCall_Dataset
The earnings conference call dataset of S&P 500 companies
YangLinyi/FactMix
Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"
YangLinyi/Grid_Search_Tutorial_for_Stock_Forecasting
A simple tutorial for using the grid search to find the optimal hyper-parameters
YangLinyi/llm.c
LLM training in simple, raw C/CUDA
YangLinyi/MAEC-A-Multimodal-Aligned-Earnings-Conference-Call-Dataset-for-Financial-Risk-Prediction
Repository for CIKM 2020 resource track paper: MAEC: A Multimodal Aligned Earnings Conference Call Dataset for Financial Risk Prediction
YangLinyi/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
YangLinyi/yanglinyi.github.io
Personal Website
YangLinyi/academic-kickstart
My Academic Homepage
YangLinyi/bert-as-service
Mapping a variable-length sentence to a fixed-length vector using BERT model
YangLinyi/FinancialDatasets
SmoothNLP 金融文本数据集(公开) Public Financial Datasets for NLP Researches Only
YangLinyi/hugo-academic
📝 The website builder for Hugo. Build and deploy a beautiful website in minutes!
YangLinyi/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
YangLinyi/lstm
YangLinyi/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
YangLinyi/PAlign
Personality Alignment of Language Models
YangLinyi/Semi-supervised-learning
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)