Skip to main content

Table 1 Size of A. pompejana protein datasets

From: Deep transcriptome-sequencing and proteome analysis of the hydrothermal vent annelid Alvinella pompejana identifies the CvP-bias as a robust measure of eukaryotic thermostability

Dataset

% identity

#full-length

#partial with stop

#total

MPI (New data)

100

6 272

15 886

28 169

 

98

5 778

14 893

26 992

 

90

5 667

14 502

26 433

JGI + Genoscope (Existing data)

100

6 233

15 539

23 962

 

98

5 360

13 365

19 890

 

90

5 008

12 341

18 155

MPI + JGI + Genoscope (Combined data)

100

10 778

26 068

42 665

 

98

9 359

23 131

38 185

 

90

8 722

21 288

35 235

  1. Number of full-length (with start and stop codon), partial (with stop codon), and total number of predicted protein sequences in the three datasets clustered at 100%, 98% and 90% identity.