Skip to content

Commit c1fcc12

Browse files
committed
Documentation clean up
1 parent 13d1876 commit c1fcc12

4 files changed

Lines changed: 30 additions & 29 deletions

File tree

docs/docs/index.md

Lines changed: 21 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -6,11 +6,15 @@ _CorpusTools_ is a set of tools to administrate Giellatekno's corpora.
66

77
A few examples:
88

9-
- [add_files_to_corpus](scripts/add_files_to_corpus) - Add raw source material files to a corpus
10-
- [convert2xml](scripts/convert2xml) - Converts original files to the Giellatekno-internal xml format.
11-
- [analyse_corpus](scripts/analyse_corpus) - Orchistrates the _hfst_ (etc) tools to run analysis on a corpus.
12-
- [ccat](scripts/ccat) - Output text sections from an analysed or non-analysed corpus.
13-
- [korp_mono](scripts/korp_mono) - Convert analysed files to korp-input
9+
- [add_files_to_corpus](scripts/add_files_to_corpus.md) - Add raw source
10+
material files to a corpus
11+
- [convert2xml](scripts/convert2xml.md) - Converts original files to the
12+
Giellatekno-internal xml format.
13+
- [analyse_corpus](scripts/analyse_corpus.md) - Orchistrates the _hfst_ (etc)
14+
tools to run analysis on a corpus.
15+
- [ccat](scripts/ccat.md) - Output text sections from an analysed or
16+
non-analysed corpus.
17+
- [korp_mono](scripts/korp_mono.md) - Convert analysed files to korp-input
1418

1519
## Installation from apertium nightly
1620

@@ -21,20 +25,22 @@ will find it.
2125

2226
## Installation using pipx
2327

24-
> pipx lets you install python packages that has runnable scripts easily,
25-
> onto your system.
28+
> pipx lets you install python packages that has runnable scripts easily, onto
29+
> your system.
2630
2731
1. Install [pipx](https://pypa.github.io/pipx/installation/)
2832
1. Run `pipx install --force git+https://github.com/divvun/CorpusTools.git`
2933

3034
### Editable install (alternate pipx installation method)
3135

3236
An _editable_ install lets you make changes in the source script files, and
33-
still use the same global command on the command line to run the (modified) scripts.
34-
Recommended if you intend to do development on the scripts.
37+
still use the same global command on the command line to run the (modified)
38+
scripts. Recommended if you intend to do development on the scripts.
3539

36-
1. Clone the CorpusTools repository: (`git clone https://github.com/divvun/CorpusTools.git CorpusTools`)
37-
1. Install with the editable flag (`-e`): `pipx install -e --force /path/to/CorpusTools`
40+
1. Clone the CorpusTools repository:
41+
(`git clone https://github.com/divvun/CorpusTools.git CorpusTools`)
42+
1. Install with the editable flag (`-e`):
43+
`pipx install -e --force /path/to/CorpusTools`
3844

3945
### Requirements
4046

@@ -43,14 +49,12 @@ Recommended if you intend to do development on the scripts.
4349
- pandoc (only needed for convert2xml?)
4450

4551
Installation commands
46-
=== "Mac"
52+
===================== "Mac"
4753

48-
```
49-
sudo port install latex2html poppler pandoc
50-
```
54+
``` sudo port install latex2html poppler pandoc ```
5155

52-
=== "Debian/Ubuntu"
56+
"Debian/Ubuntu"
5357
`sudo apt-get install poppler-utils pandoc`
5458

55-
=== "Arch Linux"
59+
"Arch Linux"
5660
`sudo pacman -S pandoc`

docs/docs/scripts/index.md

Lines changed: 8 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,14 @@
11
# Overview
22

3-
CorpusTools is a collection of scripts, each doing one specialized task.
4-
When the CorpusTools are installed, these can be run directly from the
5-
command-line.
3+
CorpusTools is a collection of scripts, each doing one specialized task. When
4+
the CorpusTools are installed, these can be run directly from the command-line.
65

76
terminology:
8-
- "module" - the folder in the root folder, e.g.
9-
"converted", "korp", "analysed", "tmx",
10-
and also directories that are part of different "areas" (??), like:
11-
"goldstandard/converted", "correct-no-gs/converted"
7+
- "module" - the folder in the root folder, e.g. "converted", "korp",
8+
"analysed", "tmx", and also directories that are part of different
9+
"areas" (??), like: "goldstandard/converted", "correct-no-gs/converted"
1210

1311

14-
- [add\_files\_to\_corpus](add_files_to_corpus)
15-
- [convert2xml](convert2xml)
16-
- [analyse\_corpus](analyse_corpus)
12+
- [add\_files\_to\_corpus](add_files_to_corpus.md)
13+
- [convert2xml](convert2xml.md)
14+
- [analyse\_corpus](analyse_corpus.md)

docs/gen_ref_pages.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@
1414

1515
for path in sorted(Path("..", "corpustools").rglob("*.py")):
1616
module_path = path.relative_to("..").with_suffix("")
17-
doc_path = path.relative_to("..", "corpustools").with_suffix(".md")
17+
doc_path = path.relative_to(Path("..") / "corpustools").with_suffix(".md")
1818
full_doc_path = Path("reference", doc_path)
1919

2020
parts = tuple(module_path.parts)

docs/mkdocs.yml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,6 @@ nav:
3939
- normalise_corpus_names: scripts/normalise_corpus_names.md
4040
- paracheck: scripts/paracheck.md
4141
- parallelize: scripts/parallelize.md
42-
- pick_parallel_docs: scripts/pick_parallel_docs.md
4342
- remove_corpus_file: scripts/remove_corpus_file.md
4443
- reparallelize: scripts/reparallelize.md
4544
- saami_crawler: scripts/saami_crawler.md

0 commit comments

Comments
 (0)