akanyaani/gpt-2-tensorflow2.0

Performance issues in data_pipeline.py(P2)

DLPerf opened this issue · 0 comments

Hello,I found a performance issue in the definition of input_fn ,
data_pipeline.py,
dataset = dataset.map(parse_example) was called without num_parallel_calls.
I think it will increase the efficiency of your program if you add this.

Here is the documemtation of tensorflow to support this thing.

Looking forward to your reply. Btw, I am very glad to create a PR to fix it if you are too busy.