Microbial 'omics

Brought to you by

# functional-enrichment-txt [artifact]

A TXT-type anvi’o artifact. This artifact is typically generated, used, and/or exported by anvi’o (and not provided by the user)..

Back to the main page of anvi’o programs and artifacts.

## Provided by

anvi-compute-functional-enrichment

## Required or used by

There are no anvi’o tools that use or require this artifact directly, which means it is most likely an end-product for the user.

## Description

This is a TAB-delimited output file that describes enrichment scores and associated groups for functions or metabolic modules in groups of genomes or samples. It is produced by anvi-compute-functional-enrichment.

## General format

Each row in the matrix describes an entity (a function, functional association of a gene cluster, or metabolic module) that is associated with one or more groups of samples or genomes. These are listed with the highest enrichment scores displayed first.

The following columns of information are listed in the file:

• the name of the enriched entity, which can be a functional association, metabolic module, or function. The header of this column is either your functional annotation source, OR ‘KEGG_MODULE’ if you are working with metabolic modules
• enrichment_score: a measure of much this particular entity is enriched in the group it is associated with (i.e., measures how unique this entity [see column 1] is to this group(s) [see column 5])
• unadjusted_p_value: the significance value of the hypothesis test for enrichment, unadjusted for multiple hypothesis testing
• adjusted_q_value: the adjusted p-value after taking into account multiple hypothesis testing
• associated groups: the list of groups that this entity is associated with
• accession: a function accession number or KEGG module number (depends on your input option for anvi-compute-functional-enrichment)
• a list of gene cluster ids, sample names, or genome names that this entity is found in (also depends on your input option for anvi-compute-functional-enrichment)
• p values for each group: gives the proportion of the group’s member genomes or samples in which this entity was found.
• N values for each group: gives the total number of genomes or samples in each group.

## A specific example - enriched functions in pangenomes

When you run anvi-compute-functional-enrichment (with input option 1) to compute enrichment scores for functions in a pangenome, the resulting matrix describes the gene cluster-level functional associations that are enriched within specific groups of your pangenome. This is described in more detail in the pangenomics tutorial.

Here is a more concrete example (the same example as in the pangenomics tutorial). Note that that tutorial uses COG_FUNCTION as the functional annotation source, and has LL (low light) and HL (high light) as the two pan-groups.