Abundance of perfect and imperfect trinucleotide repeats in chimpanzee chromosome 22

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 6111 397171.1611.12 9284 386260.0310.81 0 00.000.00 0 00.000.00 2712 169194.5812.12 4102 163294.3111.70 2922 194139.049.23 4359 189207.438.99
aag 1677 10846.973.02 3474 9197.302.55 36 347.893.99 51 367.853.99 396 2828.412.01 978 2670.171.86 1032 6449.113.04 2024 5196.312.43
aat 9243 572258.8816.02 13375 557374.6115.60 0 00.000.00 0 00.000.00 3528 213253.1315.28 5022 206360.3214.78 4881 307232.2614.61 7025 300334.2914.28
acc 1701 12847.643.58 2558 12771.643.56 87 7115.749.31 126 7167.629.31 795 6357.044.52 993 6371.254.52 783 5537.262.62 1403 5466.762.57
acg 12 10.340.03 12 10.340.03 12 115.961.33 12 115.961.33 0 00.000.00 0 00.000.00 0 00.000.00 0 00.000.00
act 108 83.020.22 206 85.770.22 0 00.000.00 0 00.000.00 48 43.440.29 103 47.390.29 60 42.850.19 103 44.900.19
agc 2565 18471.845.15 3927 179109.995.01 414 30550.7739.91 743 30988.4539.91 1050 7275.345.17 1393 6799.954.81 786 6037.402.85 1028 6048.922.85
agg 3978 300111.428.40 8729 277244.487.76 315 23419.0630.60 604 22803.5329.27 1380 10399.017.39 3023 96216.906.89 1944 14892.517.04 4555 133216.756.33
atc 1707 12147.813.39 5824 101163.122.83 27 235.922.66 30 239.912.66 528 3637.882.58 1651 32118.462.30 1005 7347.823.47 3945 57187.722.71
ccg 939 6526.301.82 2484 5969.571.65 210 16279.3721.29 453 15602.6519.95 171 1312.270.93 552 1139.600.79 480 3022.841.43 1262 2860.051.33
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.7613.37367 / 10 acg, act, ccg
number per megabase1.0000.12567 / 10 acg, act, ccg
Coding regions
length per megabase0.9170.50934 / 10 aac, aag, aat, acg, act, atc
number per megabase0.9980.03734 / 10 aac, aag, aat, acg, act, atc
Introns
length per megabase0.6923.05256 / 10 aag, acg, act, ccg
number per megabase1.0000.03656 / 10 aag, acg, act, ccg
Intergenic regions
length per megabase0.3835.28056 / 10 acg, act, agc, ccg
number per megabase1.0000.29167 / 10 acg, act, ccg