Abundance of perfect and imperfect trinucleotide repeats in human chromosome 21

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 5916 406174.3911.97 9143 395269.5111.64 0 00.000.00 0 00.000.00 1539 106163.6711.27 2296 104244.1811.06 3963 270163.9411.17 6218 262257.2210.84
aag 1797 12452.973.65 3623 103106.803.04 75 6215.9717.28 211 6607.6117.28 390 2841.482.98 571 2460.732.55 1245 8351.503.43 2721 67112.562.77
aat 8892 569262.1116.77 13144 554387.4516.33 0 00.000.00 0 00.000.00 2103 135223.6514.36 2995 132318.5114.04 6243 402258.2516.63 9384 391388.1916.17
acc 2427 15071.544.42 6088 98179.462.89 84 5241.8914.40 138 5397.3914.40 495 3452.643.62 2089 23222.162.45 1743 10372.104.26 3696 62152.892.56
acg000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00
act 381 2611.230.77 1363 2340.180.68 0 00.000.00 0 00.000.00 117 912.440.96 954 6101.460.64 237 159.800.62 373 1515.430.62
agc 1563 12146.073.57 2226 11965.623.51 204 15587.4543.20 531 141529.0940.31 387 3141.163.30 475 3150.523.30 825 6434.132.65 990 6340.952.61
agg 2367 17569.775.16 5447 151160.564.45 96 7276.4520.16 152 6437.7117.28 648 4968.915.21 1273 41135.384.36 1458 10860.314.47 3740 93154.713.85
atc 1311 9238.652.71 1957 8757.692.56 39 3112.318.64 42 3120.948.64 423 2844.982.98 715 2676.042.77 783 5632.392.32 1104 5445.672.23
ccg 429 2812.650.82 1339 2739.470.80 36 3103.678.64 53 3152.628.64 48 45.110.42 207 422.010.42 153 106.330.41 719 1029.740.41
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.8252.17056 / 9 act, atc, ccg
number per megabase0.9960.62567 / 9 act, ccg
Coding regions
length per megabase0.4222.81134 / 9 aac, aat, act, atc, ccg
number per megabase1.0000.15556 / 9 aac, aat, act
Introns
length per megabase0.2746.35156 / 9 act, agc, ccg
number per megabase0.9970.54267 / 9 act, ccg
Intergenic regions
length per megabase0.7441.95445 / 9 act, agc, atc, ccg
number per megabase0.9900.88367 / 9 act, ccg