Abundance of perfect and imperfect trinucleotide repeats in human chromosome 15

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 15783 1044192.4912.73 23660 1026288.5612.51 15 112.230.82 18 114.680.82 5505 362193.2212.71 8129 353285.3312.39 8793 582168.2011.13 13373 575255.8111.00
aag 5577 34168.024.16 15961 261194.663.18 135 11110.118.97 221 11180.268.97 1941 13068.134.56 7249 89254.443.12 3117 17759.623.39 7937 143151.832.73
aat 22659 1429276.3617.43 33087 1380403.5416.83 0 00.000.00 0 00.000.00 7737 494271.5717.34 11325 478397.5116.78 12921 810247.1715.49 19026 782363.9514.96
acc 5925 42172.265.13 13789 291168.173.55 153 12124.799.79 231 11188.418.97 1443 10750.653.76 2169 10376.133.62 3933 27275.235.20 10883 147208.182.81
acg 36 30.440.04 57 30.690.04 12 19.790.82 24 119.580.82 24 20.840.07 33 21.160.07 0 00.000.00 0 00.000.00
act 1074 6913.100.84 1633 6519.920.79 0 00.000.00 0 00.000.00 540 3718.951.30 841 3429.521.19 444 258.490.48 613 2411.730.46
agc 4383 31953.463.89 5943 31272.483.81 600 40489.3932.63 974 39794.4431.81 1602 11556.234.04 1978 11469.434.00 1746 13333.402.54 2051 13339.232.54
agg 7920 59396.597.23 18163 523221.526.38 507 40413.5332.63 895 39730.0031.81 2613 19891.726.95 5264 172184.776.04 4137 30479.145.82 10843 266207.425.09
atc 4434 29454.083.59 7021 28185.633.43 129 10105.228.16 234 10190.868.16 1533 10053.813.51 2273 9579.783.33 2349 15544.932.96 3879 14874.202.83
ccg 1881 12822.941.56 6276 11776.541.43 345 24281.4019.58 1271 241036.6919.58 156 115.480.39 525 1118.430.39 885 6016.931.15 3094 5159.190.98
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.7883.16767 / 10 acg, act, ccg
number per megabase0.9970.57667 / 10 acg, act, ccg
Coding regions
length per megabase0.3275.79256 / 10 aac, aat, acg, act
number per megabase1.0000.02656 / 10 aac, aat, acg, act
Introns
length per megabase0.5375.05167 / 10 acg, act, ccg
number per megabase0.9980.45167 / 10 acg, act, ccg
Intergenic regions
length per megabase0.5833.76756 / 10 acg, act, agc, ccg
number per megabase0.9661.39567 / 10 acg, act, ccg