mirror of
https://github.com/jlengrand/OpenGraphKt.git
synced 2026-03-10 08:31:23 +00:00
974 B
974 B
#Scrape test module
The scrape test module is intended to test the immplementation of the library at scale by parsing a large amount of webpages and checking the quality of its results
Data
At this moment
I'd like a more varied set of data from different types of sources, and the current set mostly seem to contain homepages but it's surprisingly hard to find.
Running the tests
For various reasons, I am not uploading the actual data of the various URLs. To run the analysis yourself:
- Run
Scraper.ktonce, which will grab all the webpages and place them in thedata/webfolder. - Run
ParserTest.kt, which will run theParseron each of those web pages and check whether the tags can be extracted, and if the page is considered valid.