FishTaco file formats

Taxa abundance file

A tab-delimited text file (with headers), where each row is a taxon and each column is a sample, and each cell represents the relative abundance of the taxon in the sample:


Taxa

Sample1

Sample2

Sample3

Taxon1

0.2

0.5

0.1

Taxon2

0.3

0.1

0.7

Taxon3

0.3

0.2

0.1

Taxon4

0.2

0.2

0.1




Function abundance file

A tab-delimited text file (with headers), where each row is a function (e.g., KO) and each column is a sample, and each cell represents the relative abundance of the function in the sample:


Function

Sample1

Sample2

Sample3

K00001

0.1

0.1

0.1

K00002

0.3

0.1

0.1

K00003

0.1

0.1

0.2

K00004

0.2

0.1

0.2

K00005

0.2

0.1

0.2

K00006

0.1

0.5

0.2




Sample sets labels file

A tab-delimited text file (with headers), with two columns, where each row is a sample the value is the sample label:


Sample

Label

Sample1

1

Sample2

0

Sample3

1




Genomic content file

A tab-delimited text file (with headers), where each row is a taxon and each column is a function, and each cell represents the copy number of the function in the genome of the taxon:


Taxa

K00001

K00002

K00003

K00004

K00005

K00006

Taxon1

1

2

0

1

2

0

Taxon2

0

3

0

1

0

0

Taxon3

2

1

2

2

3

1

Taxon4

0

0

2

0

3

2




FishTaco output file (*_main_output_*)

A tab-delimited text file (with headers), where each row is a taxon and each column is a function, and each cell represents the mode and value of contribution to the observed functional shift:


Taxa

K00001

K00002

K00003

K00004

K00005

K00006

Taxon1

a:0.038

a:0.4

c:0.01

c:0.34

b:-0.004

d:-0.01

Taxon2

c:0.293

b:-0.03

d:-0.002

c:0.003

a:0.0631

a:0.2139

Taxon3

b:-0.008

b:-0.015

a:0.03

c:0.0008

d:-0.009

d:-0.011

Taxon4

a:0.048

d:-0.061

b:-0.004

a:0.019

c:4.996

c:0.045


a: case-associated taxon driving enrichment of function in cases; b: case-associated taxon attenuating enrichment of function in cases; c: control-associated taxon driving enrichment of function in cases; d: control-associated taxon attenuating enrichment of function in cases;




FishTacoPlot taxonomy file (for visualization)

In order to plot FishTaco’s decomposition of functional shifts into taxon-level contributions, a taxonomy file linking the various taxa to their phylogenetic assignment is required. The taxonomy file is a tab-delimited text file with no headers, where each row represents one taxon. The format is the taxon name (matching the names in the taxonomic abundance file), followed by its assignment to the following phylogenetic levels: ‘kingdom’,’phylum’,’class’,’order’,’family’,’genus’,’species’, all prefixed by the first letter of the level followed by 2 underscores. Here is an example of the first 5 rows of the default taxonomy file used by the FishTacoPlot package:


Abiotrophia defectiva

k__Bacteria

p__Firmicutes

c__Bacilli

o__Lactobacillales

f__Aerococcaceae

g__Abiotrophia

s__defectiva

Acaryochloris marina

k__Bacteria

p__Cyanobacteria

c__Cyanophyceae

o__Synechococcales

f__Acaryochloridaceae

g__Acaryochloris

s__marina

Acetivibrio cellulolyticus

k__Bacteria

p__Firmicutes

c__Clostridia

o__Clostridiales

f__Clostridiaceae

g__Acetivibrio

s__cellulolyticus

Acetobacter pasteurianus

k__Bacteria

p__Proteobacteria

c__Alphaproteobacteria

o__Rhodospirillales

f__Acetobacteraceae

g__Acetobacter

s__pasteurianus

Acetohalobium arabaticum

k__Bacteria

p__Firmicutes

c__Clostridia

o__Halanaerobiales

f__Halobacteroidaceae

g__Acetohalobium

s__arabaticum