flairNLP/fabricator
[EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.
PythonApache-2.0
Issues
- 0
Failing CI pytest
#85 opened by HallerPatrick - 2
Supporting other open-source LLMs
#83 opened by mzeidhassan - 0
Remove spacy dependency
#55 opened by whoisjones - 0
Initial version for pypi package
#73 opened by whoisjones - 1
Naming of sampling strategies
#77 opened by whoisjones - 1
after accepted: create pypi package
#6 opened by fhamborg - 1
Cannot import DatasetGenerator
#74 opened by demongolem-biz2 - 2
Large Git Pack Files
#71 opened by HallerPatrick - 2
Implement ProGen approach
#67 opened by whoisjones - 0
Add Fabriator Predictions to Wrench Datasets
#70 opened by alanakbik - 0
- 1
- 1
Add predicted labels to CoNLL-03 dataset
#66 opened by alanakbik - 0
Improve fewshot sampling naming convention
#58 opened by whoisjones - 0
Rename to Fabricator
#60 opened by whoisjones - 0
improve logo + fix width in readme
#62 opened by whoisjones - 2
Split haystack dependencies
#56 opened by whoisjones - 0
for evaluation: compare against baseline that just k times repeats the few shot examples
#22 opened by fhamborg - 0
- 1
Reduce dependencies
#52 opened by whoisjones - 2
enable user to provide 1-sentence description for each label, which will be shown at the beginning of the prompt to the LLM
#17 opened by fhamborg - 0
Refactorings for submission
#42 opened by whoisjones - 2
- 0
Rename source directory to "src"
#3 opened by HallerPatrick - 0
More log infos
#35 opened by HallerPatrick - 0
cost reduction: generate and annotate within a single prompt (instead of first generating and then annotating)
#29 opened by fhamborg - 3
- 0
- 0
- 2
How to properly differentiate between generate unlabeled and annotate unlabeled data?
#11 opened by whoisjones - 0
Naming convention for DataPoints / Prompts
#8 opened by whoisjones - 0
Sampling method that covers all labels
#5 opened by HallerPatrick