Regex used for excluding languages from the import.
Simple regex matching Wikipedia language codes.
Simple regex matching Wikipedia language codes. Language codes have at least two characters, start with a lower-case letter and contain only lower-case letters and dash, but there are also dumps for "wikimania2005wiki" etc.
Regex for numeric range, both limits optional
This function was extracted from the ImageExtractor object, since the free & nonfree images are now extracted before starting the extraction jobs
This function was extracted from the ImageExtractor object, since the free & nonfree images are now extracted before starting the extraction jobs
pages_articles of a given language
the wikicode of a given language
two lists: ._1: list of free images, ._2: list of nonfree images
directory of wikipedia.csv, needed to resolve article count ranges
array of space- or comma-separated language codes or article count ranges
languages, sorted by language code