Abundance of perfect and imperfect trinucleotide repeats in human chromosome 12

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 26040 1665200.5812.82 38638 1625297.6112.52 12 16.940.58 12 16.940.58 8985 572205.2513.07 13212 555301.8112.68 15252 977180.8811.59 22781 956270.1711.34
aag 8436 53464.984.11 18803 445144.833.43 369 27213.3115.61 731 27422.5715.61 1842 13142.082.99 4020 12091.832.74 5217 32561.873.85 11936 259141.553.07
aat 42831 2711329.9120.88 62706 2636483.0020.30 27 215.611.16 34 219.651.16 13842 858316.2119.60 20320 832464.1919.01 25506 1629302.4919.32 37250 1588441.7618.83
acc 6984 50653.803.90 15257 430117.523.31 198 15114.468.67 438 12253.196.94 2247 16251.333.70 4004 14791.473.36 3972 28847.113.42 9938 230117.862.73
acg 12 10.090.01 21 10.160.01 12 16.940.58 21 112.140.58 0 00.000.00 0 00.000.00 0 00.000.00 0 00.000.00
act 1188 839.150.64 1754 8113.510.62 0 00.000.00 0 00.000.00 489 3511.170.80 799 3418.250.78 651 447.720.52 904 4310.720.51
agc 5502 40442.383.11 7968 39061.373.00 1035 63598.3036.42 2236 521292.5630.06 1545 11835.292.70 1860 11642.492.65 2430 18628.822.21 3206 18538.022.19
agg 10917 81284.096.25 24927 666192.005.13 660 46381.5226.59 1278 46738.7726.59 3615 26782.586.10 7203 232164.545.30 5514 41465.394.91 13588 326161.153.87
atc 8196 55163.134.24 29559 426227.683.28 183 15105.798.67 342 15197.708.67 2889 19366.004.41 8969 146204.893.33 4512 30553.513.62 19447 228230.632.70
ccg 2469 16919.021.30 7007 15653.971.20 363 25209.8414.45 1071 22619.1112.72 342 257.810.57 829 2518.940.57 1128 7513.380.89 3285 6638.960.78
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.4794.50456 / 10 acg, act, agc, ccg
number per megabase0.9990.34067 / 10 acg, act, ccg
Coding regions
length per megabase0.9650.97456 / 10 aac, aat, acg, act
number per megabase0.9940.43256 / 10 aac, aat, acg, act
Introns
length per megabase0.6873.08156 / 10 acg, act, agc, ccg
number per megabase1.0000.25367 / 10 acg, act, ccg
Intergenic regions
length per megabase0.2676.42056 / 10 acg, act, agc, ccg
number per megabase0.9910.53356 / 10 acg, act, agc, ccg