org.dbpedia.extraction.util

Finder

class Finder[T] extends AnyRef

Helps to find files and directories in a directory structure as used by the Wikipedia dump download site, for example baseDir/enwiki/20120403/enwiki-20120403-pages-articles.xml.bz2

TODO: wikiNameSuffix doesn't belong here, it should be part of the Language class (which should be renamed to WikiCode or so)

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. Finder
  2. AnyRef
  3. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new Finder(baseDir: String, language: Language, wikiNameSuffix: String)(implicit parse: (String) ⇒ T, wrap: (T) ⇒ FileLike[T])

  2. new Finder(baseDir: T, language: Language, wikiNameSuffix: String)(implicit wrap: (T) ⇒ FileLike[T])

Value Members

  1. final def !=(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  5. val baseDir: T

  6. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  7. def dates(suffix: String = null, required: Boolean = true, isSuffixRegex: Boolean = false): List[String]

    Finds the names (which are dates in format YYYYMMDD) of dump directories for the language.

    Finds the names (which are dates in format YYYYMMDD) of dump directories for the language.

    suffix Return only directories that contain a file with this suffix, e.g. "download-complete" -> "baseDir/enwiki/20120403/enwiki-20120403-download-complete". May be null, in which case we just look for date directories.

    returns

    dates in ascending order

  8. def directory(date: String): T

    date directory for language, e.g.

    date directory for language, e.g. "20120403" -> "baseDir/enwiki/20120403"

  9. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  10. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  11. def file(date: String, suffix: String): Option[T]

    File with given name suffix in date directory for language, e.g.

    File with given name suffix in date directory for language, e.g. "pages-articles.xml" -> "baseDir/enwiki/20120403/enwiki-20120403-pages-articles.xml"

  12. def file(suffix: String): Option[T]

    File with given name suffix in main directory for language, e.g.

    File with given name suffix in main directory for language, e.g. "download-running" -> "baseDir/enwiki/enwiki-download-running"

  13. def files(suffix: String, required: Boolean = true): List[T]

    returns

    files in ascending date order

  14. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  15. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  16. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  17. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  18. val language: Language

  19. def matchFiles(date: String, pattern: String): List[T]

    Files which match the supplied pattern in data directory for language.

    Files which match the supplied pattern in data directory for language. Files are sorted by size (descending)

    date
    pattern
    returns

  20. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  21. final def notify(): Unit

    Definition Classes
    AnyRef
  22. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  23. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  24. def toString(): String

    Definition Classes
    AnyRef → Any
  25. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  26. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  27. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  28. val wikiDir: T

    Directory for language, e.g.

    Directory for language, e.g. "baseDir/enwiki"

  29. val wikiName: String

    directory name/file prefix for language, e.g.

    directory name/file prefix for language, e.g. "en" -> "enwiki"

  30. val wikiNameSuffix: String

Inherited from AnyRef

Inherited from Any

Ungrouped