Abundance of perfect and imperfect trinucleotide repeats in human chromosome X

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 30240 1944198.7912.78 44931 1916295.3712.60 12 19.170.76 12 19.170.76 7059 457198.0312.82 10378 450291.1412.62 21222 1361184.2811.82 31803 1341276.1511.64
aag 10560 62869.424.13 23081 486151.733.19 270 21206.3016.05 715 21546.3216.05 2010 13356.393.73 5484 94153.842.64 7623 43666.193.79 14649 341127.202.96
aat 48177 3047316.7120.03 72390 2959475.8819.45 24 218.341.53 24 218.341.53 11106 696311.5619.52 16386 671459.6818.82 34029 2159295.4818.75 51395 2103446.2818.26
acc 7221 52447.473.44 10679 51370.203.37 174 13132.959.93 316 13241.459.93 1965 14155.123.96 2657 14074.543.93 4539 33039.412.87 6504 32256.482.80
acg 102 80.670.05 162 71.060.05 24 218.341.53 60 245.841.53 0 00.000.00 0 00.000.00 27 20.230.02 45 20.390.02
act 2256 14814.830.97 3867 14025.420.92 12 19.170.76 12 19.170.76 807 5322.641.49 1307 4836.661.35 1311 8411.380.73 2348 8120.390.70
agc 6453 45642.423.00 8846 43358.152.85 669 41511.1731.33 1128 31861.8923.69 1458 10240.902.86 1905 9353.442.61 3741 27132.482.35 4739 26941.152.34
agg 10434 74668.594.90 26285 579172.793.81 693 49529.5137.44 1389 431061.3132.86 2490 17669.854.94 6178 136173.313.81 6171 44453.593.85 16537 341143.602.96
atc 7383 48848.533.21 16704 431109.812.83 198 15151.2911.46 549 15419.4811.46 1830 12051.343.37 3427 11296.143.14 4989 32743.322.84 12179 281105.752.44
ccg 2742 17418.021.14 7589 15849.891.04 690 44527.2233.62 1547 431182.0432.86 249 186.990.51 1165 1732.680.48 996 628.650.54 2663 5523.120.48
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.8392.07456 / 10 acg, act, agc, ccg
number per megabase0.9990.36267 / 10 acg, act, ccg
Coding regions
length per megabase0.9341.31156 / 10 aac, aat, acg, act
number per megabase0.9900.55656 / 10 aac, aat, acg, act
Introns
length per megabase0.7402.74356 / 10 acg, act, agc, ccg
number per megabase0.9980.47567 / 10 acg, act, ccg
Intergenic regions
length per megabase0.7002.19345 / 10 acc, acg, act, agc, ccg
number per megabase0.9990.37167 / 10 acg, act, ccg