Scrapers always check for a local copy of the target resource (using Scraper.checkForLocalRecord) before executing a scrape from an external resource. If the resource was found (and therefore no external calls made), this is set to true.
Id of the local database record associated with this page scrape
A simple, human-readble description of what is being scraped. Used for logging.
used for caching failed results, to blacklist further calls
TypeORM repository handling all data flow in/out of database table
Contains all results generated by Scraper.scrape, including recursive calls.
Stores the DOM retrieved by scraperapi
Flag indicating a sucessful scrape, set to true after non-error-throwing call to Scraper.scrape.
External url indicating the scraper's target resource.
Used to override .env settings and force-log the output of a given scraper.
Extracts and stores:
Extracts and stores (from example raw element text: Male / 33
)
Simple CLI reporting tool for debugging unsuccessful scrapes
Queries scraperapi for ScraperApiScraper.url
Required .env
variables:
SCRAPER_API_KEY
: scraperapi dashboardSCRAPER_API_REQUEST_ATTEMPTS
: Times a request is allowed to fail before error
is thrownTracks the number of times a given request has failed, used to track recurring calls to this function. Should never be set if called externally.
Entry point for initiating an asset scrape. General scrape outline/method order:
If set to true, scrapes the external resource regardless of any existing local records
Scrape the user's favorite artists
When a scrape fails, add it to the blacklist, then throw the error
Scrape the genres associated with this artist
Generated using TypeDoc
Manages the scraping and storage of a profile from Rate Your Music.
For more information on class properties, see corresponding props in ProfileEntity.