Explore projects
-
-
Code Challenge for ofcourseme hiring process, scraper based on Scrapy for the Federica MOOC platform. Scraped data are stored in a Django database and exposed through a webpanel.
Archived 0Updated -
Crawls the entire natureasia website and produce sitemap XML files from it. Uses Scrapy open-source Python module.
Updated -
This tool was designed to extract member institution locations from the official North American Reciprocal Association (NARM) website and places them into KML format. There are a few Google maps versions of this data created by individuals out there (like this one) that are probably mostly accurate, but the reciprocal agreements evidently update at least annually, which means they have a higher potential for inaccuracy every year they are not updated. Hopefully that dynamism within the NARM locations makes this script a particularly useful one, as (in theory) until NARM makes a major update to their website source code, this scraper will always produce a kml file with the most updated map at any time of any year.
Updated -
Made to scrape known moons from Wikipedia for lastmiles streams (https://twitch.tv/lastmiles)
Updated -
modulargrid-based scraper of some eurorack resellers
Updated -
-
August Sandoval / Manga Desktop Reader
MIT LicenseA simple manga desktop reader written in python GTK and Tkinter
UpdatedUpdated -
-
Igor Benek-Lins / Jobs Afunnilator
MIT LicenseSimple scraper of job websites with clustering visualisation options
Updated -
JCU Menza menu site scraper. List menus like a human. https://hajnyon.gitlab.io/jcu-menza-scraper/
Updated -
-
TUVIMEN / invision-scraper
GNU General Public License v3.0 or laterA bash script for scraping invision forums in json
Updated -
fe / InstagramScraper
GNU General Public License v3.0 onlyIncrementally download Instagram posts, stories, collections and profilepictures from a set of given accounts
Updated -
explore and download movies much EASIER
Updated -
Todos los ISBN de Indautor en formato JSON.
Archived 0Updated -
hydrargyrum / img-lurker
Do What The F*ck You Want To Public LicenseScrape a web page and downloads images (even if they are on other linked pages)
Updated -
TUVIMEN / imdb-scraper
GNU General Public License v3.0 or laterA very simple shell script for scraping imdb in json
Updated -
Simple python script to scrap all the files from a thread. Should work with most imageboards.
Updated -
marzzzello / gplaycrawler
MIT LicenseDiscover apps by different mehtods. Mass download app packages and metadata.
Updated