Installation — INDRA 1.22.0 documentation (original) (raw)
Installing Python
INDRA is a Python package so the basic requirement for using it is to have Python installed. Python is shipped with most Linux distributions and with OSX. INDRA works with Python 3.6 or higher.
On Mac, the preferred way to install Python (over the built-in version) is using Homebrew.
On Windows, we recommend using Anacondawhich contains compiled distributions of the scientific packages that INDRA depends on (numpy, scipy, pandas, etc).
Installing INDRA
Installing via Github
The preferred way to install INDRA is to use pip and point it to either a remote or a local copy of the latest source code from the repository. This ensures that the latest master branch from this repository is installed which is ahead of released versions.
To install directly from Github, do:
pip install git+https://github.com/sorgerlab/indra.git
Or first clone the repository to a local folder and use pip to install INDRA from there locally:
git clone https://github.com/sorgerlab/indra.git cd indra pip install .
Cloning the source code from Github
You may want to simply clone the source code without installing INDRA as a system-wide package.
git clone https://github.com/sorgerlab/indra.git
To be able to use INDRA this way, you need to make sure that all its requirements are installed. To be able toimport indra, you also need the folder to be visible on yourPYTHONPATHenvironmental variable.
Installing releases with pip
Releases of INDRA are also available viaPyPI. You can install the latest released version of INDRA as
INDRA dependencies
INDRA depends on a few standard Python packages (e.g. rdflib, requests, objectpath). These packages are installed automatically by pip.
Below we provide a detailed description of some extra dependencies that may require special steps to install.
PySB and BioNetGen
INDRA builds on the PySB framework to assemble rule-based models of biochemical systems. The pysb python package is installed by the standard install procedure. However, to be able to generate mathematical model equations and to export to formats such as SBML, theBioNetGenframework also needs to be installed in a way that is visible to PySB. Detailed instructions are given in thePySB documentation.
Pyjnius
Pyjnius is currently not required for any of INDRA’s features. However, to be able to use INDRA’s optional JAR-based offline reading via the REACH and Eidos APIs,pyjnius is needed to allow using Java/Scala classes from Python.
1. Install JDK from Oracle: https://www.oracle.com/technetwork/java/javase/downloads/index.html. We recommend using Java 8 (INDRA is regularly tested with Java 8), however, Java 11 is also expected to be compatible, with possible extra configuration steps needed that are not described here.
- Set JAVA_HOME to your JDK home directory, for instance
export JAVA_HOME=/Library/Java/JavaVirtualMachines/jdk-11.0.2.jdk/Contents/Home
- Then first install cython followed by pyjnius (tested with version 1.1.4). These need to be broken up into two sequential calls to pip install.
pip install cython pip install pyjnius==1.1.4
On Mac, you may need toinstall Legacy Java for OSX. If you have trouble installing it, you can try the following as an alternative. Edit
/Library/Java/JavaVirtualMachines/jdk-11.0.2.jdk/Contents/Info.plist
(the JDK folder name will need to correspond to your local version), and add JNI to JVMCapabilities as
... JVMCapabilities CommandLine JNI ...
Graphviz
Some INDRA modules contain functions that useGraphviz to visualize graphs. On most systems, doing
works. However on Mac this often fails, and, assuming Homebrew is installed one has to
brew install graphviz pip install pygraphviz --install-option="--include-path=/usr/local/include/graphviz/" --install-option="--library-path=/usr/local/lib/graphviz"
where the –include-path and –library-path needs to be set based on where Homebrew installed graphviz.
Optional additional dependencies
Some dependencies of INDRA are only needed by certain submodules or are only used in specialized use cases. These are not installed by default but are listed as “extra” requirements, and can be installed separately using pip. An extra dependency list (e.g. one called extra_list) can be installed as
pip install indra[extra_list]
You can also install all extra dependencies by doing
pip install indra --install-option="complete"
or
In all of the above, you may replace indra with . (if you’re in a local copy of the indra folder or with the Github URL of the INDRA repo, depending on your installation method. See also the correspondingpip documentationfor more information.
The table below provides the name and the description of each “extra” list of dependencies.
Extra list name | Purpose |
---|---|
bel | BEL input processing and output assembly |
trips_offline | Offline reading with local instance of TRIPS system |
reach_offline | Offline reading with local instance of REACH system |
eidos_offline | Offline reading with local instance of Eidos system |
geneways | Genewayas reader input processing |
sofia | SOFIA reader input processing |
bbn | BBN reader input processing |
sbml | SBML model export through the PySB Assembler |
grounding | Packages for re-grounding and disambiguating entities |
machine | Running a local instance of a “RAS machine” |
explanation | Finding explanatory paths in rule-based models |
aws | Accessing AWS compute and storage resources |
graph | Assembling into a visualizing Graphviz graphs |
plot | Create and display plots |
Configuring INDRA
Various aspects of INDRA, including API keys, dependency locations, and Java memory limits, are parameterized by a configuration file that lives in ~/.config/indra/config.ini. The default configuration file is provided in indra/resources/default_config.ini, and is copied to ~/.config/indra/config.ini when INDRA starts if no configuration already exists. Every value in the configuration can also be set as an environment variable: for a given configuration key, INDRA will first check for an environment variable with that name and if not present, will use the value in the configuration file. In other words, an environment variable, when set, takes precedence over the value set in the config file.
Configuration values include:
- REACHPATH: The location of the JAR file containing a local instance of the REACH reading system
- EIDOSPATH: The location of the JAR file containing a local instance of the Eidos reading system
- SPARSERPATH: The location of a local instance of the Sparser reading system (path to a folder)
- DRUMPATH: The location of a local installation of the DRUM reading system (path to a folder)
- NDEX_USERNAME, NDEX_PASSWORD: Credentials for accessing the NDEx web service
- ELSEVIER_API_KEY, ELSEVIER_INST_KEY: Elsevier web service API keys
- BIOGRID_API_KEY: API key for BioGRID web service (seehttp://wiki.thebiogrid.org/doku.php/biogridrest)
- INDRA_DEFAULT_JAVA_MEM_LIMIT: Maximum memory limit for Java virtual machines launched by INDRA
- SITEMAPPER_CACHE_PATH: Path to an optional cache (a pickle file) for the SiteMapper’s automatically obtained mappings.