this post was submitted on 31 Aug 2023
21 points (86.2% liked)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

54698 readers
495 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder

📜 c/Piracy Wiki (Community Edition):


💰 Please help cover server costs.

Ko-Fi Liberapay
Ko-fi Liberapay

founded 1 year ago
MODERATORS
 

Hello, So basically I want to know how to code an extractor that allows you to get the m3u8 file from an embedded video. I am trying to build a scraper to get animes episodes from websites like gogoanime or 9anime. The only thing I was able to scrape so far is the embedded link for the episodes. Any idea on how I can do it ?

top 5 comments
sorted by: hot top controversial new old
[–] smpl@discuss.tchncs.de 6 points 1 year ago

I build a lot of tools like that and the first thing I do is to go to the developer tool in my browser and observe the network traffic. When you find the resource you're after you scroll back and see what requests resulted in that URL. Going from those requests you figure out in the original static HTML document and resource, which parameters are used for the construction of the URL, that might require reversing some javascript, but that's rare. After that you'll have a pretty good idea how you obtain the video resource from the original URL. Beware of cookie set by the requests, they might be needed to access the next requests. For building my tools I use Perl or sometimes just Bash or a GreaseMonkey userscript to fetch and parse the urls and construct the desired output.

[–] backhdlp@lemmy.blahaj.zone 1 points 1 year ago (1 children)

Try to learn from the source code of similar projects.

[–] Nemila@lemmy.dbzer0.com 0 points 1 year ago (2 children)
[–] dM9sxNPexK3bwZS@lemmy.sdfeu.org 2 points 1 year ago* (last edited 1 year ago)

YT-DLP has a list of extractors for lots of different sites. You could see if there are similar sites and see how their extractors work.

https://github.com/yt-dlp/yt-dlp/tree/master/yt_dlp/extractor

Or Animescraper has extractors for them, but it was last updated 6 years ago.

https://github.com/jQwotos/animeDownloaders/tree/master

https://github.com/jQwotos/anime_scrapers/tree/63b415fcaaa685f03b54fe6ee294c13178736637/scrapers

Maybe movie-web