Abundance of perfect and imperfect trinucleotide repeats in human chromosome 13

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 17169 1131179.3111.81 25426 1118265.5511.68 75 5114.597.64 138 5210.857.64 3879 253169.0811.03 5497 250239.6010.90 12147 803168.3511.13 18115 794251.0611.00
aag 4608 30048.133.13 9301 26797.142.79 126 10192.5115.28 252 10385.0315.28 822 5935.832.57 1481 5764.552.48 3534 22148.983.06 7318 190101.422.63
aat 27471 1782286.9018.61 41071 1738428.9418.15 0 00.000.00 0 00.000.00 5964 380259.9516.56 8422 370367.0916.13 20532 1334284.5618.49 30933 1301428.7218.03
acc 3489 25836.442.69 7190 23575.092.45 99 5151.267.64 309 5472.127.64 897 6839.102.96 1653 5672.052.44 2379 17632.972.44 5021 16569.592.29
acg 30 20.310.02 36 20.380.02 0 00.000.00 0 00.000.00 0 00.000.00 0 00.000.00 12 10.170.01 18 10.250.01
act 831 608.680.63 1286 5913.430.62 0 00.000.00 0 00.000.00 192 138.370.57 259 1211.290.52 576 427.980.58 949 4213.150.58
agc 3615 26037.762.71 4994 25252.162.63 348 24531.7136.67 755 191153.5629.03 972 6942.373.01 1333 6858.102.96 2130 15429.522.13 2656 15236.812.11
agg 5118 39553.454.12 11809 352123.333.68 99 8151.2612.22 222 8339.1912.22 1389 10760.544.66 3471 93151.294.05 3402 26247.153.63 7573 234104.963.24
atc 3798 24739.672.58 7248 23575.702.45 132 10201.6815.28 198 10302.5215.28 1059 6546.162.83 2469 60107.622.62 2514 16634.842.30 4437 15961.492.20
ccg 1512 9915.791.03 4167 9043.520.94 432 29660.0544.31 1577 222409.4833.61 66 42.880.17 75 43.270.17 606 398.400.54 1387 3819.220.53
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.9351.30156 / 10 acg, act, agc, ccg
number per megabase1.0000.06867 / 10 acg, act, ccg
Coding regions
length per megabase0.4904.42456 / 10 aac, aat, acg, act
number per megabase0.9910.82567 / 10 aat, acg, act
Introns
length per megabase0.8832.36567 / 10 acg, act, ccg
number per megabase1.0000.14467 / 10 acg, act, ccg
Intergenic regions
length per megabase0.9511.13756 / 10 acg, act, agc, ccg
number per megabase1.0000.08556 / 10 acg, act, agc, ccg