A template for starting a new allennlp project using config files and allennlp train
. For simple
projects, all you need to do is get your model code into the class in my_project/model.py
, your
data loading code into the DatasetReader
code in my_project/dataset_reader.py
, and that's it,
you can train a model (we recommend also making appropriate changes to the test code, and using that
for development, but that's optional).
See the AllenNLP Guide for a quick start on how to use what's in this example project. We're grabbing the model and dataset reader classes from that guide. You can replace those classes with a model and dataset reader for whatever you want (including copying code from our model library as a starting point). The very brief version of what's in here:
-
A
Model
class inmy_project/model.py
. -
A
DatasetReader
class inmy_project/dataset_reader.py
. -
Tests for both of these classes in
tests
, including a small toy dataset that can be read. We strongly recommend that you use a toy dataset with tests like this during model development, for quick debug cycles. To run the tests, just runpytest
from the base directory of this project. -
An
.allennlp_plugins
file, which makes it easier to useallennlp train
with this project. If you change the name ofmy_project
, you should also change the line in this file to match. -
An example configuration file for training the model, which you would use with the following shell command from the base directory of this repository, after
pip install allennlp
:allennlp train -s /dir/to/save/results training_config/my_model_trained_on_my_dataset.json