VoxBlink2/ScriptsForVoxBlink2

Questions about recording device and distance

Closed this issue · 1 comments

Hello,

When creating this dataset, did you pay attention to the distribution of recording devices and distances?

In addition, the following error is reported when downloading this dataset, so is there any chance that the mirror of this dataset can be provided in the future?

[debug] Command-line config: ['-Uv', '--proxy', 'xxxxx', 'Q0W6wcio384']
[debug] Encodings: locale UTF-8, fs utf-8, pref UTF-8, out utf-8, error utf-8, screen utf-8
[debug] yt-dlp version stable@2024.05.26 from yt-dlp/yt-dlp [ae2af1104] (zip)
[debug] Python 3.8.10 (CPython x86_64 64bit) - Linux-5.15.0-105-generic-x86_64-with-glibc2.29 (OpenSSL 1.1.1f  31 Mar 2020, glibc 2.31)
[debug] exe versions: ffmpeg 4.2.7, ffprobe 4.2.7
[debug] Optional libraries: sqlite3-3.31.1
[debug] Proxy map: {'all': 'xxxx'}
[debug] Request Handlers: urllib
[debug] Loaded 1820 extractors
[debug] Fetching release info: https://api.github.com/repos/yt-dlp/yt-dlp/releases/latest
ERROR: Unable to obtain version info (EOF occurred in violation of protocol (_ssl.c:1131)); Please try again later or visit  https://github.com/yt-dlp/yt-dlp/releases/latest
[youtube] Extracting URL: Q0W6wcio384
[youtube] Q0W6wcio384: Downloading webpage
[youtube] Q0W6wcio384: Downloading ios player API JSON
ERROR: [youtube] Q0W6wcio384: Sign in to confirm you’re not a bot. This helps protect our community. Learn more
  File "/usr/local/bin/yt-dlp/yt_dlp/extractor/common.py", line 734, in extract
    ie_result = self._real_extract(url)
  File "/usr/local/bin/yt-dlp/yt_dlp/extractor/youtube.py", line 4248, in _real_extract
    self.raise_no_formats(reason, expected=True)
  File "/usr/local/bin/yt-dlp/yt_dlp/extractor/common.py", line 1257, in raise_no_formats
    raise ExtractorError(msg, expected=expected, video_id=video_id)

Sorry, these informations are not provided from YouTube.
Now the Youtube platform seems to conduct robot testing occasionally. We cannot provide the raw dataset because of the privacy security, but there is an alternative that you can sleep the process from time to time.
Hope the trick helps you!