5 Gargantext is a collaborative web platform for the exploration of sets
6 of unstructured documents. It combines tools from natural language
7 processing, text-mining, complex networks analysis and interactive data
8 visualization to pave the way toward new kinds of interactions with your
9 digital corpora. This software is a free software, developed by the CNRS
10 Complex Systems Institute of Paris Île-de-France (ISC-PIF) and its
15 Disclaimer: this project is still in development, this is work in
16 progress. Please report and improve this documentation if you encounter issues.
20 NOTE: Default build (with optimizations) requires large amounts of RAM (16GB at least). To avoid heavy compilation times and swapping out your machine, it is recommended to `stack build` with the `--fast-` flag, i.e.:
22 stack --docker build --fast
26 stack --nix build --fast
28 This might be related to the [broken Swagger `-O2` issue](https://github.com/haskell-servant/servant/issues/986).
33 curl -sSL https://gitlab.iscpif.fr/gargantext/haskell-gargantext/raw/dev/devops/docker/docker-install | sh
39 curl -sSL https://gitlab.iscpif.fr/gargantext/haskell-gargantext/raw/dev/devops/debian/install | sh
45 curl -sSL https://gitlab.iscpif.fr/gargantext/haskell-gargantext/raw/dev/devops/ubuntu/install | sh
50 1. CoreNLP is needed (EN and FR); This dependency will not be needed soon.
53 ./devops/install-corenlp
56 2. Louvain C++ needed to draw the socio-semantic graphs
58 NOTE: This is already added in the Docker build.
61 git clone https://gitlab.iscpif.fr/gargantext/clustering-louvain-cplusplus.git
62 cd clustering-louvain-cplusplus
77 Initialization schema should be loaded automatically (from `devops/postgres/schema.sql`).
81 ##### Fix the passwords
83 Change the passwords in gargantext.ini_toModify then move it:
86 mv gargantext.ini_toModify gargantext.ini
88 (`.gitignore` avoids adding this file to the repository by mistake)
93 Users have to be created first (`user1` is created as instance):
97 ~/.local/bin/gargantext-init "gargantext.ini"
100 For Docker env, first create the appropriate image:
104 docker build -t cgenie/stack-build:lts-17.13-garg .
110 stack --docker run gargantext-init -- gargantext.ini
115 You can import some data with:
117 docker run --rm -it -p 9000:9000 cgenie/corenlp-garg
118 stack exec gargantext-import -- "corpusCsvHal" "user1" "IMT3" gargantext.ini 10000 ./1000.csv
123 It is also possible to build everything with [Nix](https://nixos.org/) instead of Docker:
126 stack --nix exec gargantext-import -- "corpusCsvHal" "user1" "IMT3" gargantext.ini 10000 ./1000.csv
127 stack --nix exec gargantext-server -- --ini gargantext.ini --run Prod
132 ### Multi-User with Graphical User Interface (Server Mode)
135 ~/.local/bin/stack --docker exec gargantext-server -- --ini "gargantext.ini" --run Prod
138 Then you can log in with `user1` / `1resu`.
141 ### Command Line Mode tools
143 #### Simple cooccurrences computation and indexation from a list of Ngrams
146 stack --docker exec gargantext-cli -- CorpusFromGarg.csv ListFromGarg.csv Ouput.json
149 ### Analyzing the ngrams table repo
151 We store the repository in directory `repos` in the [CBOR](https://cbor.io/)
152 file format. To decode it to JSON and analyze, say, using
153 [jq](https://shapeshed.com/jq-json/), use the following command:
156 cat repos/repo.cbor.v5 | stack --nix exec gargantext-cbor2json | jq .
160 To build documentation, run:
163 stack --docker build --haddock --no-haddock-deps --fast