pruvisto/debirdify

Expand t.co links in spurious results

Opened this issue · 2 comments

Expand t.co links in spurious results

I think I found an example of this bug.

The plain text of Twitter user "@brewster_kahle"'s bio says mastodon: @brewsterkahle@mastodon.archive.org

Twitter seems to encode that as mastodon: @brewsterkahle@mastodon.<a href="https://t.co/7ojy915Rea"><span>http://</span>archive.org</a>

Debirdify misinterprets the account as brewsterkahle@mastodon.https instead of @brewsterkahle@mastodon.archive.org

I can confirm this, it makes some profile appear in spurious results and not be properly picked up.

For example, the HTML is:

<a dir="ltr" href="https://t.co/FQB2mjmfqn" rel="noopener noreferrer nofollow" target="_blank" role="link" class="css-4rbku5 css-18t94o4 css-901oao css-16my406 r-1cvl2hr r-1loqt21 r-poiln3 r-bcqeeo r-qvutc0">
  <span aria-hidden="true" class="css-901oao css-16my406 r-poiln3 r-hiw28u r-qvk6io r-bcqeeo r-qvutc0">http://</span>
  oisaur.com/@yoanmalie
</a>

Debirdify does not detect this profile properly