@@ -6,11 +6,15 @@ _CorpusTools_ is a set of tools to administrate Giellatekno's corpora.
66
77A few examples:
88
9- - [ add_files_to_corpus] ( scripts/add_files_to_corpus ) - Add raw source material files to a corpus
10- - [ convert2xml] ( scripts/convert2xml ) - Converts original files to the Giellatekno-internal xml format.
11- - [ analyse_corpus] ( scripts/analyse_corpus ) - Orchistrates the _ hfst_ (etc) tools to run analysis on a corpus.
12- - [ ccat] ( scripts/ccat ) - Output text sections from an analysed or non-analysed corpus.
13- - [ korp_mono] ( scripts/korp_mono ) - Convert analysed files to korp-input
9+ - [ add_files_to_corpus] ( scripts/add_files_to_corpus.md ) - Add raw source
10+ material files to a corpus
11+ - [ convert2xml] ( scripts/convert2xml.md ) - Converts original files to the
12+ Giellatekno-internal xml format.
13+ - [ analyse_corpus] ( scripts/analyse_corpus.md ) - Orchistrates the _ hfst_ (etc)
14+ tools to run analysis on a corpus.
15+ - [ ccat] ( scripts/ccat.md ) - Output text sections from an analysed or
16+ non-analysed corpus.
17+ - [ korp_mono] ( scripts/korp_mono.md ) - Convert analysed files to korp-input
1418
1519## Installation from apertium nightly
1620
@@ -21,20 +25,22 @@ will find it.
2125
2226## Installation using pipx
2327
24- > pipx lets you install python packages that has runnable scripts easily,
25- > onto your system.
28+ > pipx lets you install python packages that has runnable scripts easily, onto
29+ > your system.
2630
27311 . Install [ pipx] ( https://pypa.github.io/pipx/installation/ )
28321 . Run ` pipx install --force git+https://github.com/divvun/CorpusTools.git `
2933
3034### Editable install (alternate pipx installation method)
3135
3236An _ editable_ install lets you make changes in the source script files, and
33- still use the same global command on the command line to run the (modified) scripts.
34- Recommended if you intend to do development on the scripts.
37+ still use the same global command on the command line to run the (modified)
38+ scripts. Recommended if you intend to do development on the scripts.
3539
36- 1 . Clone the CorpusTools repository: (` git clone https://github.com/divvun/CorpusTools.git CorpusTools ` )
37- 1 . Install with the editable flag (` -e ` ): ` pipx install -e --force /path/to/CorpusTools `
40+ 1 . Clone the CorpusTools repository:
41+ (` git clone https://github.com/divvun/CorpusTools.git CorpusTools ` )
42+ 1 . Install with the editable flag (` -e ` ):
43+ ` pipx install -e --force /path/to/CorpusTools `
3844
3945### Requirements
4046
@@ -43,14 +49,12 @@ Recommended if you intend to do development on the scripts.
4349- pandoc (only needed for convert2xml?)
4450
4551Installation commands
46- === "Mac"
52+ ===================== "Mac"
4753
48- ```
49- sudo port install latex2html poppler pandoc
50- ```
54+ ``` sudo port install latex2html poppler pandoc ```
5155
52- === "Debian/Ubuntu"
56+ "Debian/Ubuntu"
5357` sudo apt-get install poppler-utils pandoc `
5458
55- === "Arch Linux"
59+ "Arch Linux"
5660` sudo pacman -S pandoc `
0 commit comments