clic.migrate.corpora_repo: Import books from the CCR corpora repository¶
-
clic.migrate.corpora_repo.export_book(book, dir='.', write_regions=False)¶ Export book object (book) to “(dir)/(book[‘name’]).txt”
If write_regions is true, write a book.regions.csv next to the book containing all regions.
-
clic.migrate.corpora_repo.get_corpora_for(book_paths)¶ Return all corpora objects that contain the books in book_paths
-
clic.migrate.corpora_repo.import_book(book_path)¶ Read book at (book_path)
-
clic.migrate.corpora_repo.parse_corpora_bib(corpora_dir, bib_name='corpora.bib')¶ Read in a corpora.bib, output a list of corpora dicts
-
clic.migrate.corpora_repo.script_import_corpora_repo()¶ Import corpora book file(s) into DB
-
clic.migrate.corpora_repo.script_region_export()¶ Calculate regions and re-export them as CSV
-
clic.migrate.corpora_repo.to_region_file(book_path)¶ Path of relevant region file for book_path