Abundance of perfect and imperfect trinucleotide repeats in human chromosome 11

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 22620 1489172.7911.37 34554 1466263.9511.20 12 16.050.50 12 16.050.50 7818 508178.7911.62 11663 500266.7311.44 13416 884157.4710.38 20766 871243.7410.22
aag 7908 49660.413.79 16855 427128.753.26 408 33205.6816.64 723 33364.4816.64 2070 13947.343.18 4069 13093.062.97 4875 28457.223.33 10968 229128.732.69
aat 39039 2511298.2119.18 57138 2461436.4718.80 0 00.000.00 0 00.000.00 13398 847306.4019.37 19255 828440.3518.94 22716 1480266.6217.37 33626 1452394.6817.04
acc 6966 50853.213.88 14575 447111.343.42 264 18133.099.07 438 18220.819.07 2358 17453.933.98 5233 148119.683.38 3471 25740.743.02 7179 23784.262.78
acg000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00
act 1233 909.420.69 2079 8615.880.66 0 00.000.00 0 00.000.00 483 3711.050.85 860 3519.670.80 648 457.610.53 1068 4312.540.51
agc 6069 44446.363.39 7714 43758.933.34 813 55409.8627.73 1213 50611.5125.21 1938 14244.323.25 2380 14154.433.23 2646 19631.062.30 3275 19538.442.29
agg 10365 78279.185.97 20056 723153.215.52 696 50350.8725.21 1325 48667.9724.20 3534 26980.826.15 7275 243166.385.56 5118 38660.074.53 9818 359115.244.21
atc 7668 52158.583.98 19524 444149.143.39 363 26183.0013.11 587 25295.9212.60 2241 15651.253.57 4971 145113.683.32 4269 29050.113.40 12267 233143.982.73
ccg 2124 15116.231.15 5788 14544.211.11 456 36229.8818.15 1056 35532.3617.64 282 216.450.48 915 2020.930.46 729 518.560.60 2168 4825.450.56
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.8681.85856 / 9 act, agc, ccg
number per megabase1.0000.13967 / 9 act, ccg
Coding regions
length per megabase0.9531.11356 / 9 aac, aat, act
number per megabase1.0000.05856 / 9 aac, aat, act
Introns
length per megabase0.8961.64256 / 9 act, agc, ccg
number per megabase1.0000.09767 / 9 act, ccg
Intergenic regions
length per megabase0.7892.41356 / 9 act, agc, ccg
number per megabase1.0000.23967 / 9 act, ccg