Table 4 In-depth analysis of the outlier pentapeptides of the type a2bcd.

From: Global pentapeptide statistics are far away from expected distributions

Focus residue

Protein region

% “aa”

%“axa”

%“axxa”

%“axxxa”

Total counts

A

DM

23

26

22

27

443

A

ND

20

22

18

37

30

A

NN

100

0

0

0

6

C

DM

2

5

82

9

530

C

ND

3

17

44

34

60

C

NN

2

0

73

24

22

D

DM

20

36

22

20

366

D

ND

38

20

20

20

27

D

NN

0

33

66

0

7

E

DM

37

15

31

15

456

E

ND

45

0

42

12

23

E

NN

85

0

14

0

13

F

DM

27

25

20

26

340

F

ND

30

27

23

18

29

F

NN

100

0

0

0

2

G

DM

21

28

28

21

673

G

ND

22

18

50

8

63

G

NN

33

66

0

0

5

H

DM

18

20

15

46

361

H

ND

10

17

10

62

17

H

NN

0

0

0

100

3

I

DM

23

20

28

27

395

I

ND

21

8

27

42

34

I

NN

0

0

0

100

1

K

DM

31

20

25

22

330

K

ND

41

21

20

16

35

K

NN

56

24

0

18

17

L

DM

18

12

30

37

511

L

ND

36

9

41

11

46

L

NN

50

0

16

33

8

M

DM

18

17

30

33

231

M

ND

7

13

39

39

24

M

NN

0

0

33

66

2

N

DM

24

24

21

29

296

N

ND

20

53

26

0

11

N

NN

0

100

0

0

3

P

DM

15

21

31

32

464

P

ND

15

20

38

25

44

P

NN

20

0

0

80

2

Q

DM

34

28

19

18

227

Q

ND

10

28

17

42

29

Q

NN

    

0

R

DM

30

29

28

11

383

R

ND

31

47

7

14

21

R

NN

0

33

33

33

6

S

DM

30

21

27

20

278

S

ND

29

12

38

19

13

S

NN

63

0

36

0

9

T

DM

21

29

26

23

376

T

ND

11

49

15

23

32

T

NN

0

57

0

42

5

V

DM

31

30

18

19

375

V

ND

7

28

21

42

19

V

NN

57

21

0

21

15

W

DM

23

38

19

18

305

W

ND

28

28

14

28

27

W

NN

    

0

Y

DM

28

20

25

25

282

Y

ND

42

22

0

34

8

Y

NN

0

0

0

100

2

  1. For every “focus residue” a, the highest abundant outliers (z > 100) were considered that contained exactly two occurrences of the focus residue. Then, occurrences were considered where the focus residue was separated by 0, 1, 2 or 3 residues.