shirayu/whispering

Duplicated output bug

shirayu opened this issue · 2 comments

Sometimes duplicated results are shown.

80.40->84.40    もうこのすればの方をつけるために
[2022-09-23 23:29:33,283] transcriber._deal_timestamp DEBUG -> Length of buffer: 0
[2022-09-23 23:29:33,283] transcriber.transcribe DEBUG -> Last rest_start=None
[2022-09-23 23:29:33,283] cli.transcribe_from_mic DEBUG -> Segment: 12
[2022-09-23 23:29:33,284] transcriber.transcribe DEBUG -> seek=0, timestamp=84.4, rest_start=None
[2022-09-23 23:29:34,192] transcriber.transcribe DEBUG -> Result: temperature=0.00, no_speech_prob=0.25, avg_logprob=-0.47
[2022-09-23 23:29:34,192] transcriber._deal_timestamp DEBUG -> Length of consecutive: 0
84.40->88.40    もうこのすればの方をつけるために
[2022-09-23 23:29:34,192] transcriber._deal_timestamp DEBUG -> Length of buffer: 0
[2022-09-23 23:29:34,192] transcriber.transcribe DEBUG -> Last rest_start=None
[2022-09-23 23:29:34,192] cli.transcribe_from_mic DEBUG -> Segment: 13
[2022-09-23 23:29:34,193] transcriber.transcribe DEBUG -> seek=0, timestamp=88.4, rest_start=None
[2022-09-23 23:29:35,097] transcriber.transcribe DEBUG -> Result: temperature=0.00, no_speech_prob=0.25, avg_logprob=-0.47
[2022-09-23 23:29:35,097] transcriber._deal_timestamp DEBUG -> Length of consecutive: 0
88.40->92.40    もうこのすればの方をつけるために
[2022-09-23 23:29:35,098] transcriber._deal_timestamp DEBUG -> Length of buffer: 0
[2022-09-23 23:29:35,098] transcriber.transcribe DEBUG -> Last rest_start=None
[2022-09-23 23:29:35,098] cli.transcribe_from_mic DEBUG -> Segment: 14
[2022-09-23 23:29:35,099] transcriber.transcribe DEBUG -> seek=0, timestamp=92.4, rest_start=None
[2022-09-23 23:29:35,994] transcriber.transcribe DEBUG -> Result: temperature=0.00, no_speech_prob=0.25, avg_logprob=-0.47
[2022-09-23 23:29:35,994] transcriber._deal_timestamp DEBUG -> Length of consecutive: 0
92.40->96.40    もうこのすればの方をつけるために

This may be related to openai/whisper#29

As author answered, this is caused a limitation of wihsper.
openai/whisper#29 (comment)

#5 is needed to relieve this problem.