CellRegulomiX Interface Help

Run, Score and Network Files

Name of the Run

Purpose
The Run Name is a short textual identifier for your analysis. It is used to:

  • name the output directory;
  • group results under a unique identifier;
  • avoid overwriting previous runs.

Requirements

  • Should be unique if you want to keep multiple runs.
  • Allowed characters: letters (A-Z, a-z), digits (0-9), dashes (-) and underscores (_).
  • Avoid spaces and special characters to prevent filesystem issues.

Example

MyRun_01
TF_analysis_January
AML_case23

Number of Network Files

Description
Defines how many network files will be uploaded. The UI dynamically creates one upload widget per specified file.

Accepted values

  • Minimum: 1
  • Maximum: 5

Network File Format

Description
A network file defines regulatory interactions between transcription factors (TFs) and target genes. Each row represents a directed interaction: TF → Target.

Accepted formats

  • .csv, .tsv, .txt (comma or tab separated)
  • Accepted delimiters: comma ,, tab \t, semicolon ;

Required columns (recommended)

Column Name Description
TF Name of the transcription factor (source)
Target Name of the regulated gene (destination)
Weight (optional) Interaction score or confidence

Example (TSV)

TF  Target  Weight
TP53    CDKN1A  0.92
STAT3   SOCS3   0.81
MYC CCND2   0.76

Notes

  • Use consistent gene naming across files (HGNC symbols, Ensembl IDs, etc.).
  • If header names differ, adapt them before upload or use the app’s mapping options (if available).
  • Duplicate edges are allowed but may be collapsed during processing.

TF Selection

Preselect TFs using positive/negative data

Description
When enabled, TF selection uses two datasets (positive and negative) to rank TFs via differential evidence.

When to use
Enable when your experiment includes positive vs negative/ control samples.


Node Score File

Format
Two-column table with gene identifier and score value.

Column Description
Gene Gene name
Score Numeric score

TF List File

Description
Optional plain list of TFs to force into the analysis.

Format
One TF name per line, or a single-column CSV with header.


3. Target Gene Color Attributes

Continuous Value File

Description
Used to color nodes using continuous values (e.g., expression, fold change).

Example

Gene Value
TP53 2.1
MYC -0.8
JUN 1.7

Category File

Description
Used to color nodes by category/group (discrete values).

Example

Gene Group
TP53 Group1
MYC Group2
JUN Group1

Advanced Parameters

This section collects advanced parameters. Use these only if you understand their algorithmic impact.

Improvement Rate

Controls how aggressively the algorithm expands candidate subnetworks (numeric, default: 2.0).

Max Subnetwork Depth

Maximum expansion depth (integer).

Min Subnetwork Size

Minimum nodes required for a valid subnetwork (integer).

Number of Random Subnetworks

Number of random subnetworks used for significance assessment.

Minimum Number of TFs

Minimum TFs required in a subnetwork.

Overlap Threshold

Controls redundancy between returned subnetworks (0-1).

Seed Type

Seed selection strategy: TFs only, Targets only, or Both.


🇬🇧 English
All materials on this site are licensed under:
Creative Commons Attribution – NonCommercial – NoDerivatives 4.0 International (CC BY-NC-ND 4.0)