Corpora
  • Motivation for corpora system
  • Corpora API
    • Creating new corpus
    • Appending document to corpus
    • Sequential access to corpus
    • Random access to corpus
    • Size of corpus
    • Exceptions
  • Corpus internal format
    • File config
    • File chunkN
    • File idx
    • File ridx
  • Benchmarks for corpora system
    • Corpus size overhead
    • Document append performance
  • Credits
    • Author
    • Source code and contribution
 
Corpora
  • Docs »
  • Edit on GitHub

Index

_ | A | C | G | M | S | T

_

__getitem__() (corpora.Corpus method)
__iter__() (corpora.Corpus method)
__len__() (corpora.Corpus method)

A

add() (corpora.Corpus method), [1]

C

corpora.Corpus.ExceptionDuplicate
corpora.Corpus.ExceptionTooBig
Corpus (class in corpora)
Corpus.ExceptionDuplicate
Corpus.ExceptionTooBig
create() (corpora.Corpus static method), [1]

G

get() (corpora.Corpus method), [1]
get_by_idx() (corpora.Corpus method)
get_chunk() (corpora.Corpus method)
get_idx() (corpora.Corpus method)
get_ridx() (corpora.Corpus method)

M

make_new_chunk() (corpora.Corpus method)

S

save_config() (corpora.Corpus method)
save_indexes() (corpora.Corpus method), [1]

T

test_chunk_size() (corpora.Corpus method)

© Copyright 2011, Krzysztof Dorosz.

Built with Sphinx using a theme provided by Read the Docs.
Read the Docs v: latest
Versions
latest
Downloads
pdf
htmlzip
epub
On Read the Docs
Project Home
Builds

Free document hosting provided by Read the Docs.