AlbumScraper | MuCritic

Manages the scraping and storage of an album from Rate Your Music.

For more information on class properties, see corresponding props in AlbumEntity.

Hierarchy

RymScraper<AlbumEntity>
- AlbumScraper

Index

Constructors

constructor

Methods

Constructors

constructor

new AlbumScraper(url: string, verbose?: boolean): AlbumScraper

Overrides ScraperApiScraper.constructor
- Defined in scrapers/rym/albumScraper.ts:39
Parameters
- url: string
  
  Example: https://rateyourmusic.com/release/album/aphex-twin/_i-care-because-you-do/
- Default value verbose: boolean = false
Returns AlbumScraper

Properties

artist

artist: ArtistScraper

dataReadFromLocal

dataReadFromLocal: boolean

Scrapers always check for a local copy of the target resource (using Scraper.checkForLocalRecord) before executing a scrape from an external resource. If the resource was found (and therefore no external calls made), this is set to true.

databaseId

databaseId: number

Id of the local database record associated with this page scrape

description

description: string

A simple, human-readble description of what is being scraped. Used for logging.

genreScrapers

genreScrapers: GenreScraper[]

issueCountRYM

issueCountRYM: number

listCountRYM

listCountRYM: number

name

name: string

overallRankRYM

overallRankRYM: number

ratingCountRYM

ratingCountRYM: number

ratingRYM

ratingRYM: number

redis

redis: RedisHelper

used for caching failed results, to blacklist further calls

Protected repository

repository: Repository<AlbumEntity>

TypeORM repository handling all data flow in/out of database table

results

results: ResultBatch

Contains all results generated by Scraper.scrape, including recursive calls.

reviewCountRYM

reviewCountRYM: number

Protected scrapeRoot

scrapeRoot: ParseElement

Stores the DOM retrieved by scraperapi

scrapeSucceeded

scrapeSucceeded: boolean

Flag indicating a sucessful scrape, set to true after non-error-throwing call to Scraper.scrape.

url

url: string

External url indicating the scraper's target resource.

verbose

verbose: boolean

Used to override .env settings and force-log the output of a given scraper.

yearRankRYM

yearRankRYM: number

Methods

checkForLocalRecord

checkForLocalRecord(): Promise<boolean>

Inherited from RymScraper.checkForLocalRecord

Overrides Scraper.checkForLocalRecord
- Defined in scrapers/rym/rymScraper.ts:23
Returns Promise<boolean>

Private extractCountInfo

extractCountInfo(): void

- Defined in scrapers/rym/albumScraper.ts:67
Extracts and stores three similarly laid out elements, parsed by extractCountFromPair:
Returns void

Protected extractInfo

extractInfo(): void

Overrides Scraper.extractInfo
- Defined in scrapers/rym/albumScraper.ts:92
Returns void

Private extractMainInfoBlocks

extractMainInfoBlocks(): void

- Defined in scrapers/rym/albumScraper.ts:111
The main information on Album pages is represented by a series of elements for which order and quantity are both indeterminate. This method loops through them, storing info based on their header text.

Extracts and stores:
- AlbumScraper.artist
- AlbumScraper.ratingRYM
- AlbumScraper.ratingCountRYM
- AlbumScraper.yearRankRYM
- AlbumScraper.overallRankRYM
- [[AlbumScraper.genreScrapersRYM]] (uses GenreScraper.createScrapers)
Returns void

Private extractName

extractName(): void

- Defined in scrapers/rym/albumScraper.ts:168
Extracts and stores
- AlbumScraper.name
Returns void

getEntity

getEntity(): Promise<AlbumEntity>

Overrides RymScraper.getEntity
- Defined in scrapers/rym/albumScraper.ts:176
Returns Promise<AlbumEntity>

printInfo

printInfo(): void

Overrides Scraper.printInfo
- Defined in scrapers/rym/albumScraper.ts:180
Returns void

printResult

printResult(): void

Inherited from Scraper.printResult
- Defined in scrapers/scraper.ts:96
Simple CLI reporting tool for debugging unsuccessful scrapes

Returns void

requestScrape

requestScrape(attempts?: number): Promise<void>

Inherited from ScraperApiScraper.requestScrape

Overrides Scraper.requestScrape
- Defined in scrapers/scraperApiScraper.ts:49
Queries scraperapi for ScraperApiScraper.url

Required .env variables:
- SCRAPER_API_KEY: scraperapi dashboard
- SCRAPER_API_REQUEST_ATTEMPTS: Times a request is allowed to fail before error is thrown
Parameters
- Default value attempts: number = 0
  
  Tracks the number of times a given request has failed, used to track recurring calls to this function. Should never be set if called externally.
Returns Promise<void>

Protected saveToLocal

saveToLocal(): Promise<void>

Overrides RymScraper.saveToLocal
- Defined in scrapers/rym/albumScraper.ts:195
Returns Promise<void>

scrape

scrape(forceScrape?: boolean): Promise<void>

Inherited from Scraper.scrape
- Defined in scrapers/scraper.ts:163
Entry point for initiating an asset scrape. General scrape outline/method order:
1. Scraper.checkForLocalRecord
2. If local entity was found, update class props and return.
3. Scraper.requestScrape
4. Scraper.extractInfo
5. Scraper.scrapeDependencies
6. Scraper.saveToLocal
7. Update class props and return
remarks

This method should be considered unsafe - there are several points where this can throw errors. This is intentional, and allows easier support for relational data scraping/storage. Scraped assets may have a mixture of required and non-required dependencies, the status of which should be kept in mind when implementing Scraper.scrapeDependencies. A subclass should catch and log errors from non-required scrapes. However, errors from a required scrape should remain uncaught, so the original call to a Scraper.scrape will error out before [[Scraper.save]] is called for incomplete data.
Parameters
- Default value forceScrape: boolean = false
  
  If set to true, scrapes the external resource regardless of any existing local records
Returns Promise<void>

Protected scrapeDependencies

scrapeDependencies(): Promise<void>

Overrides Scraper.scrapeDependencies
- Defined in scrapers/rym/albumScraper.ts:227
Scrape the artist and genres associated with this album

Returns Promise<void>

Protected scrapeErrorHandler

scrapeErrorHandler(error: Error): Promise<void>

Inherited from ScraperApiScraper.scrapeErrorHandler

Overrides Scraper.scrapeErrorHandler
- Defined in scrapers/scraperApiScraper.ts:78
When a scrape fails, add it to the blacklist, then throw the error

Parameters
- error: Error
Returns Promise<void>

Static scrapeDependencyArr

scrapeDependencyArr<T>(scrapers: T[], forceScrape?: boolean): Promise<ScrapersWithResults<T>>

Inherited from Scraper.scrapeDependencyArr
- Defined in scrapers/scraper.ts:204
Scrape the genres associated with this artist

Type parameters
- T: Scraper
Parameters
- scrapers: T[]
- Default value forceScrape: boolean = false
Returns Promise<ScrapersWithResults<T>>

Hierarchy

Index

Constructors

Properties

Methods

Constructors

constructor

Parameters

url: string

Default value verbose: boolean = false

Returns AlbumScraper

Properties

artist

dataReadFromLocal

databaseId

description

genreScrapers

issueCountRYM

listCountRYM

name

overallRankRYM

ratingCountRYM

ratingRYM

redis

Protected repository

results

reviewCountRYM

Protected scrapeRoot

scrapeSucceeded

url

verbose

yearRankRYM

Methods

checkForLocalRecord

Returns Promise<boolean>

Private extractCountInfo

Returns void

Protected extractInfo

Returns void

Private extractMainInfoBlocks

Returns void

Private extractName

Returns void

getEntity

Returns Promise<AlbumEntity>

printInfo

Returns void

printResult

Returns void

requestScrape

Parameters

Default value attempts: number = 0

Returns Promise<void>

Protected saveToLocal

Returns Promise<void>

scrape

Parameters

Default value forceScrape: boolean = false

Returns Promise<void>

Protected scrapeDependencies

Returns Promise<void>

Protected scrapeErrorHandler

Parameters

error: Error

Returns Promise<void>

Static scrapeDependencyArr

Type parameters

T: Scraper

Parameters

scrapers: T[]

Default value forceScrape: boolean = false

Returns Promise<ScrapersWithResults<T>>