revdotcom/speech-datasets

Transcript issues for 4363614 in earnings-21

huangruizhe opened this issue · 0 comments

Good|0||||UC|[]|[]
day|0||||LC|[]|[]
everyone|0|||,|LC|[]|[]
and|0||||LC|[]|[]
welcome|0||||LC|[]|[]
to|0||||LC|[]|[]
<unk>|0||||LC|[]|[]
second|0||||LC|[]|['4']
quarter|0||||LC|[]|['4']
2020|0||||CA|['0:YEAR']|['0', '1', '4']
earnings|0||||LC|[]|[]
conference|0||||LC|[]|[]
call|0|||.|LC|[]|[]
Today's|0||||UC|[]|[]
call|0||||LC|[]|[]
is|0||||LC|[]|[]
being|0||||LC|[]|[]
recorded|0|||.|LC|[]|[]
Following|0||||UC|[]|[]
the|0||||LC|[]|[]
speaker's|0||||LC|[]|[]
remarks|0|||,|LC|[]|[]
there|0||||LC|[]|[]
will|0||||LC|[]|[]
be|0||||LC|[]|[]
a|0||||LC|[]|[]
question|0||||LC|[]|[]
and|0||||LC|[]|[]
answer|0||||LC|[]|[]
session|0|||.|LC|[]|[]
I'd|0||||UC|['2:CONTRACTION']|['2']
now|0||||LC|[]|[]
like|0||||LC|[]|[]
to|0||||LC|[]|[]
turn|0||||LC|[]|[]
the|0||||LC|[]|[]
conference|0||||LC|[]|[]
over|0||||LC|[]|[]
to|0||||LC|[]|[]
Mr|0|||.|UC|[]|[]
<inaudible>|0|||,|LC|[]|[]
Managing|0||||UC|[]|[]
Director|0||||UC|[]|[]

It seems the transcript there has some issue, as quoted. E.g. <unk> for company's name, <inaudible> for person's name.
This can be checked against here