Abundance of perfect and imperfect trinucleotide repeats in chimpanzee chromosome 19

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 11976 775190.6312.34 18093 765288.0012.18 12 16.030.50 15 17.540.50 4788 311192.6212.51 7012 306282.0912.31 6024 388167.4410.79 9402 385261.3310.70
aag 3879 25461.744.04 9861 220156.963.50 240 19120.679.55 450 19226.269.55 1194 8048.033.22 3087 72124.192.90 2109 13258.623.67 5382 111149.593.08
aat 21378 1279340.2920.36 30737 1231489.2619.59 12 16.030.50 12 16.030.50 8487 509341.4320.48 12175 493489.8019.83 11226 672312.0318.68 16317 645453.5317.93
acc 2817 20944.843.33 4376 20469.663.25 201 15101.067.54 259 15130.237.54 1092 8043.933.22 1820 7873.223.14 1296 9736.022.70 2003 9555.672.64
acg 27 20.430.03 39 20.620.03 15 17.540.50 21 110.560.50 0 00.000.00 0 00.000.00 0 00.000.00 0 00.000.00
act 192 143.060.22 280 144.460.22 15 17.540.50 15 17.540.50 96 73.860.28 165 76.640.28 69 51.920.14 88 52.450.14
agc 2889 21345.993.39 3953 21162.923.36 759 57381.6328.66 1099 55552.5827.65 867 6534.882.62 1233 6549.602.62 969 7126.931.97 1208 7133.581.97
agg 7167 531114.088.45 17545 475279.277.56 768 56386.1528.16 1712 53860.8026.65 2598 192104.527.72 5803 170233.456.84 3264 24190.726.70 8498 215236.205.98
atc 2247 15435.772.45 5301 13484.382.13 60 530.172.51 87 543.742.51 882 5835.482.33 1950 5478.452.17 1143 8031.772.22 3043 6484.581.78
ccg 2421 16938.542.69 7799 161124.142.56 549 43276.0421.62 1787 42898.5121.12 570 3922.931.57 1740 3770.001.49 1101 7530.602.08 3721 70103.431.95
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.5952.77945 / 10 acg, act, agc, atc, ccg
number per megabase1.0000.08956 / 10 acg, act, atc, ccg
Coding regions
length per megabase0.1994.65034 / 10 aac, aat, acc, acg, act, atc
number per megabase1.0000.01845 / 10 aac, aat, acg, act, atc
Introns
length per megabase0.7012.18945 / 10 acg, act, agc, atc, ccg
number per megabase1.0000.07656 / 10 acg, act, atc, ccg
Intergenic regions
length per megabase0.3743.11834 / 10 acc, acg, act, agc, atc, ccg
number per megabase0.9980.12045 / 10 acg, act, agc, atc, ccg