Skip to content

Installation


This section will help you set up the necessary software and packages required to run SigProfilerClusters.


Prerequisites

  • Python version >= 3.9
  • SigProfilerMatrixGenerator with a downloaded reference genome (required to generate mutational matrices and simulations)
  • SigProfilerSimulator (required to generate the background model used for IMD threshold calculation)
  • Other dependencies are installed automatically during package installation

Installation

SigProfilerClusters can be executed on any Windows/macOS/Unix system. First follow the SigProfilerMatrixGenerator guide for installing Python and pip. Next, follow the instructions below for the latest stable release or the current GitHub version.

Installation with pip

Install the latest SigProfilerClusters PyPI version using pip:

$ pip install SigProfilerClusters

To upgrade an existing installation to the most recent version:

$ pip install SigProfilerClusters --upgrade

Install specific GitHub Release

First, download the zip file or clone the GitHub repository:

$ git clone https://github.com/SigProfilerSuite/SigProfilerClusters.git

Next, enter the downloaded directory and install the package:

$ cd SigProfilerClusters
$ pip install .

Download Reference Genome

SigProfilerClusters requires a reference genome to generate simulations and mutational matrices. Current reference genomes supported include GRCh37, GRCh38, mm9, and mm10. To install the reference genome/s, you need to use SigProfilerMatrixGenerator.

The last PyPI SigProfilerMatrixGenerator version is installed with SigProfilerClusters by default. You can also install a specific version following the instructions in the SigProfilerMatrixGenerator Wiki.

Once SigProfilerMatrixGenerator is installed, install your desired reference genome from the command line/terminal as follows.

Installation from command line

$ SigProfilerMatrixGenerator install GRCh37

Installation from Python terminal

$ python
>>> from SigProfilerMatrixGenerator import install as genInstall
>>> genInstall.install('GRCh37', rsync=False, bash=True)

If you have a firewall on your server, you may need to install rsync and use the rsync=True parameter. Similarly, if you do not have bash available, use bash=False.

In case you prefer to install a reference genome from a local copy:

$ python
>>> from SigProfilerMatrixGenerator import install as genInstall
>>> genInstall.install('GRCh37', offline_files_path='path/to/directory/containing/GRCh37.tar.gz')