Fully-automated scripts for collecting AI-related papers. Support fuzzy and exact search for paper titles.
- [ACL 2019-2021] [EMNLP 2019-2021] [NAACL 2019-2021] [COLING 2020]
- [CVPR 2019-2021] [ECCV 2020] [ICCV2019] [ACMMM 2019-2021]
- [ICLR 2019-2021] [ICML 2019-2021] [AAAI 2019-2021] [IJCAI 2019-2021]
- [SIGIR 2019-2021] [KDD 2019-2021] [CIKM 2019-2021] [WSDM 2019-2022]
- [WWW 2019-2021] [ECIR 2019-2022] [NIPS 2019-2021] [ICASSP 2019-2021]
- [ASE 2019-2021] [ISWC 2019-2021] [MLSys 2020-2022]
Current installation is to clone this repo.
git clone https://github.com/MLNLP-World/AI-Paper-Collector.git
cd AI-Paper-Collector
pip install -r requirements.txt
We provide two usage modes, the first is interactive (main.py
)and the second is command-line (cli_main.py
).
The interactive mode is recommended for the first time users.
To start the interactive, type:
python main.py
Serveral steps to interactively search paper.
- the keyword query
- search mode (exact or fuzzy)
- (fuzzy) threshold
- the limit of results
- a list of conferences, separated by comma
- the file path of the output (top-5 for command preview, all results in this file)
E.g.
[+] Initializing System...
[+] Loading from cache...
[+] Enter your query: few-shot
[+] Select search mode:
[1] Exact
[2] Fuzzy
[+] Enter a number between 1 to 2: 2
[+] Enter threshold between 0 and 100 (default: 50):
[+] Enter limit >= 0 (default: None):
[+] Enter the list of confs separated by comma
E.g. "ACL,CVPR" or "AAAI" or enter nothing for all confs
[+] Enter your list of conferences (default: All Confs): SIGIR,WSDM,CIKM
[+] Search Results:
[=] Only show Top-5, Please Save results to see all.
[1] [CIKM2021] REFORM: Error-Aware Few-Shot Knowledge Graph Completion.
[2] [CIKM2021] Boosting Few-shot Abstractive Summarization with Auxiliary Tasks.
[3] [CIKM2021] Multi-objective Few-shot Learning for Fair Classification.
[4] [CIKM2020] Graph Few-shot Learning with Attribute Matching.
[5] [CIKM2020] Few-shot Insider Threat Detection.
[+] Enter Save filename:
[+] Writing results to output/fuzzy_None_SIGIR_WSDM_CIKM_few-shot.txt
[+] Writing results Done!
For command-line usage, you can use the following commands:
# -q, --query: the input query, and the content with multiple words should be wrapped in quotation marks
# -m, --mode: the search mode: fuzzy or exact, default is exact
# -t, --threshold: the threshold for the fuzzy search, default is 50
# -l, --limit: the limit num of the fuzzy search result, default is None
# -c, --conf: the list of the conferences needs to search, default is all
# -o, --output: the output file name, default is [mode]_[threshold]_[confs]_[query].txt
# -f, --force: force to update the cache file incrementally
python cli_main.py --query QUERY \
[--mode {fuzzy,exact}] \
[--threshold THRESHOLD] [--limit LIMIT] [--conf CONF] \
[--output OUTPUT] [--force]
E.g.
# Note that the input query must be enclosed in `""`, such as "few shot".
python cli_main.py -q "few shot" -m fuzzy -l 10 -t 10 -c AAAI,ACL -o results.txt
If anyone wants to add a new list of conferences. please raise an issue following the format of this one. We will check and label it, then the workflow will run automatically. issue format
- add new conferences by modifying the
conf/dblp_conf.json
file
[
# add the name and dblp_url of the new conf
{
"name": "WWW2021",
"url": "https://dblp.org/db/conf/www/www2021.html"
},
...
]
- run the script
# force to update the cache file incrementally
python cli_main.py --query '' --force
Thanks to the contributors: