voc/voctoweb

Invalid entry in https://api.media.ccc.de/public/events/recent

litetex opened this issue · 1 comments

Hi together,
we recently noticed an error inside NewPipeExtractor.

It looks like https://api.media.ccc.de/public/events/recent is returning a single event/video that is corrupt:

{
    "guid": "c78934cb-fd2b-4b7e-bc3a-0e530e8161f4",
    "title": "Demonight",
    "subtitle": null,
    "slug": "dgwk2022-4163-demonight",
    "link": "https://cfp.winterkongress.ch/wk22/talk/8W7FN3/",
    "description": "Demonight von Echtzeit – Digitale Kultur\n\n",
    "original_language": "deu",
    "persons": [
        "Echtzeit – Digitale Kultur"
    ],
    "tags": [
        "import",
        "4163",
        "2022",
        "Andere"
    ],
    "view_count": 0,
    "promoted": false,
    "date": "2022-02-26T21:15:00.000+01:00",
    "release_date": "2022-03-10T00:00:00.000+01:00",
    "updated_at": "2022-03-10T06:25:30.016+01:00",
    "length": 0,
    "duration": 0,
    "thumb_url": "https://static.media.ccc.de/media/conferences/dgwk/2022/4163-c78934cb-fd2b-4b7e-bc3a-0e530e8161f4.jpg",
    "poster_url": "https://static.media.ccc.de/media/conferences/dgwk/2022/4163-c78934cb-fd2b-4b7e-bc3a-0e530e8161f4_preview.jpg",
    "timeline_url": "https://static.media.ccc.de/media/conferences/dgwk/2022/4163-c78934cb-fd2b-4b7e-bc3a-0e530e8161f4.timeline.jpg",
    "thumbnails_url": "https://static.media.ccc.de/media/conferences/dgwk/2022/4163-c78934cb-fd2b-4b7e-bc3a-0e530e8161f4.thumbnails.vtt",
    "frontend_link": "https://media.ccc.de/v/dgwk2022-4163-demonight",
    "url": "https://api.media.ccc.de/public/events/c78934cb-fd2b-4b7e-bc3a-0e530e8161f4",
    "conference_title": "Winterkongress 2022",
    "conference_url": "https://api.media.ccc.de/public/conferences/dgwk2022",
    "related": []
}

Most URLs (e.g. for thumbnails) are not working and the length and duration parameter are both 0.

Currently this is returned at index 68 of https://api.media.ccc.de/public/events/recent.

Is this intended behavior or are we e.g. using the wrong endpoint?

Hmm, right now everything looks fine – probably a "race" condition where the event was already added to the database, but the files were not yet uploaded...

We could change https://github.com/voc/voctopublish/blob/production/voctopublish/voctopublish.py#L188-L204 to first upload the files, and then create the event...

For reference, here is the current result of that route quoted above, the video can be found at https://media.ccc.de/v/dgwk2022-4163-demonight

{
"guid": "c78934cb-fd2b-4b7e-bc3a-0e530e8161f4",
"title": "Demonight",
"subtitle": null,
"slug": "dgwk2022-4163-demonight",
"link": "https://cfp.winterkongress.ch/wk22/talk/8W7FN3/",
"description": "Demonight von Echtzeit – Digitale Kultur\n\n",
"original_language": "deu",
"persons": [
"Echtzeit – Digitale Kultur"
],
"tags": [
"import",
"4163",
"2022",
"Andere"
],
"view_count": 24,
"promoted": false,
"date": "2022-02-26T21:15:00.000+01:00",
"release_date": "2022-03-21T00:00:00.000+01:00",
"updated_at": "2022-03-22T20:15:06.980+01:00",
"length": 7292,
"duration": 7292,
"thumb_url": "https://static.media.ccc.de/media/conferences/dgwk/2022/4163-c78934cb-fd2b-4b7e-bc3a-0e530e8161f4.jpg",
"poster_url": "https://static.media.ccc.de/media/conferences/dgwk/2022/4163-c78934cb-fd2b-4b7e-bc3a-0e530e8161f4_preview.jpg",
"timeline_url": "https://static.media.ccc.de/media/conferences/dgwk/2022/4163-c78934cb-fd2b-4b7e-bc3a-0e530e8161f4.timeline.jpg",
"thumbnails_url": "https://static.media.ccc.de/media/conferences/dgwk/2022/4163-c78934cb-fd2b-4b7e-bc3a-0e530e8161f4.thumbnails.vtt",
"frontend_link": "https://media.ccc.de/v/dgwk2022-4163-demonight",
"url": "https://api.media.ccc.de/public/events/c78934cb-fd2b-4b7e-bc3a-0e530e8161f4",
"conference_title": "Winterkongress 2022",
"conference_url": "https://api.media.ccc.de/public/conferences/dgwk2022",
"related": []
},