Corpora
Motivation for corpora system
Corpora API
Creating new corpus
Appending document to corpus
Sequential access to corpus
Random access to corpus
Size of corpus
Exceptions
Corpus internal format
File
config
File
chunkN
File
idx
File
ridx
Benchmarks for corpora system
Corpus size overhead
Document append performance
Credits
Author
Source code and contribution
Corpora
Docs
»
Edit on GitHub
Index
_
|
A
|
C
|
G
|
M
|
S
|
T
_
__getitem__() (corpora.Corpus method)
__iter__() (corpora.Corpus method)
__len__() (corpora.Corpus method)
A
add() (corpora.Corpus method)
,
[1]
C
corpora.Corpus.ExceptionDuplicate
corpora.Corpus.ExceptionTooBig
Corpus (class in corpora)
Corpus.ExceptionDuplicate
Corpus.ExceptionTooBig
create() (corpora.Corpus static method)
,
[1]
G
get() (corpora.Corpus method)
,
[1]
get_by_idx() (corpora.Corpus method)
get_chunk() (corpora.Corpus method)
get_idx() (corpora.Corpus method)
get_ridx() (corpora.Corpus method)
M
make_new_chunk() (corpora.Corpus method)
S
save_config() (corpora.Corpus method)
save_indexes() (corpora.Corpus method)
,
[1]
T
test_chunk_size() (corpora.Corpus method)
Read the Docs
v: latest
Versions
latest
Downloads
pdf
htmlzip
epub
On Read the Docs
Project Home
Builds
Free document hosting provided by
Read the Docs
.