Corpora
Motivation for corpora system
Corpora API
Creating new corpus
Appending document to corpus
Sequential access to corpus
Random access to corpus
Size of corpus
Exceptions
Corpus internal format
File
config
File
chunkN
File
idx
File
ridx
Benchmarks for corpora system
Corpus size overhead
Document append performance
Credits
Author
Source code and contribution
Corpora
Docs
»
Edit on GitHub
Index
_
|
A
|
C
|
G
|
M
|
S
|
T
_
__getitem__() (corpora.Corpus method)
__iter__() (corpora.Corpus method)
__len__() (corpora.Corpus method)
A
add() (corpora.Corpus method)
,
[1]
C
corpora.Corpus.ExceptionDuplicate
corpora.Corpus.ExceptionTooBig
Corpus (class in corpora)
Corpus.ExceptionDuplicate
Corpus.ExceptionTooBig
create() (corpora.Corpus static method)
,
[1]
G
get() (corpora.Corpus method)
,
[1]
get_by_idx() (corpora.Corpus method)
get_chunk() (corpora.Corpus method)
get_idx() (corpora.Corpus method)
get_ridx() (corpora.Corpus method)
M
make_new_chunk() (corpora.Corpus method)
S
save_config() (corpora.Corpus method)
save_indexes() (corpora.Corpus method)
,
[1]
T
test_chunk_size() (corpora.Corpus method)