Abundance of perfect and imperfect trinucleotide repeats in human chromosome 8

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 26409 1717184.5312.00 38700 1695270.4111.84 15 112.940.86 15 112.940.86 7551 482182.7911.67 11022 475266.8111.50 17139 1122170.2911.15 25180 1108250.1911.01
aag 8997 55762.873.89 18001 477125.783.33 147 11126.799.49 530 10457.128.62 2448 15359.263.70 4918 131119.053.17 5535 35054.993.48 10921 306108.513.04
aat 39654 2574277.0817.99 58370 2515407.8617.57 0 00.000.00 0 00.000.00 11466 749277.5618.13 16856 737408.0417.84 25863 1678256.9716.67 38113 1634378.6916.23
acc 6633 48046.353.35 16058 397112.202.77 129 9111.267.76 252 9217.357.76 2565 18462.094.45 8271 128200.223.10 3534 25635.112.54 6856 22968.122.27
acg 21 10.150.01 36 10.250.01 0 00.000.00 0 00.000.00 0 00.000.00 0 00.000.00 21 10.210.01 36 10.360.01
act 1545 11010.800.77 2735 10219.110.71 0 00.000.00 0 00.000.00 477 3411.550.82 801 3019.390.73 933 659.270.65 1416 6314.070.63
agc 5418 40437.862.82 7521 40052.552.79 399 30344.1425.88 897 30773.6625.88 1665 12240.302.95 2226 12153.892.93 2982 22529.632.24 3799 22437.752.23
agg 10746 81075.095.66 25702 634179.594.43 357 25307.9121.56 831 22716.7418.98 3762 28291.076.83 8750 202211.814.89 5979 45559.414.52 14492 363143.993.61
atc 8145 52056.913.63 15336 465107.163.25 177 13152.6611.21 234 13201.8211.21 2553 16461.803.97 4319 154104.553.73 4833 30648.023.04 9911 26698.472.64
ccg 2409 15916.831.11 6501 14345.421.00 429 30370.0125.88 1167 291006.5325.01 225 155.450.36 528 1412.780.34 1107 6811.000.68 3473 5634.510.56
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.8442.03756 / 10 acg, act, agc, ccg
number per megabase0.9990.33367 / 10 acg, act, ccg
Coding regions
length per megabase0.7392.74656 / 10 aac, aat, acg, act
number per megabase1.0000.12856 / 10 aac, aat, acg, act
Introns
length per megabase0.6053.62556 / 10 acg, act, agc, ccg
number per megabase0.9910.82967 / 10 acg, act, ccg
Intergenic regions
length per megabase0.8921.67856 / 10 acg, act, agc, ccg
number per megabase1.0000.24167 / 10 acg, act, ccg