Abundance of perfect and imperfect trinucleotide repeats in human chromosome 6

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 32199 2084190.2912.32 47423 2037280.2612.04 12 17.150.59 15 18.930.59 10239 661200.4112.94 14682 647287.3812.66 20517 1327176.2011.40 30438 1294261.4011.11
aag 11649 69868.844.12 25168 554148.743.27 195 16116.129.53 459 16273.339.53 2568 17150.273.35 5522 146108.082.86 8331 47871.554.11 17856 361153.343.10
aat 49332 3147291.5418.60 72060 3059425.8618.08 36 321.441.79 60 335.731.79 14499 926283.8018.12 20886 906408.8117.73 31923 2037274.1517.49 46826 1973402.1316.94
acc 7332 54743.333.23 14596 48986.262.89 219 16130.419.53 428 13254.877.74 1953 14638.232.86 3108 14060.832.74 4530 34038.902.92 9847 29684.562.54
acg 99 80.580.05 105 80.620.05 15 18.930.59 15 18.930.59 60 51.170.10 60 51.170.10 12 10.100.01 18 10.150.01
act 1935 14011.440.83 3179 12718.790.75 0 00.000.00 0 00.000.00 642 4312.570.84 1056 3420.670.67 1125 849.660.72 1888 8016.210.69
agc 6609 48539.062.87 10396 47461.442.80 945 59562.7435.13 1707 531016.5031.56 1884 14136.882.76 2415 13947.272.72 3228 24527.722.10 5547 24447.642.10
agg 11112 82865.674.89 22366 747132.184.42 657 49391.2429.18 1455 47866.4427.99 3033 23059.374.50 5661 214110.814.19 6594 48656.634.17 13613 427116.913.67
atc 7374 50243.582.97 14968 46988.462.77 174 14103.618.34 330 14196.518.34 2319 16145.393.15 3421 15766.963.07 4497 30138.622.58 10672 27391.652.34
ccg 2811 19216.611.14 7794 17146.061.01 843 57502.0033.94 2014 491199.3229.18 231 164.520.31 520 1610.180.31 1047 708.990.60 3134 6126.910.52
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.9731.26667 / 10 acg, act, ccg
number per megabase1.0000.17167 / 10 acg, act, ccg
Coding regions
length per megabase0.9830.69456 / 10 aac, aat, acg, act
number per megabase0.9990.19056 / 10 aac, aat, acg, act
Introns
length per megabase0.9750.83756 / 10 acg, act, agc, ccg
number per megabase1.0000.06467 / 10 acg, act, ccg
Intergenic regions
length per megabase0.8821.75256 / 10 acg, act, agc, ccg
number per megabase0.9980.28456 / 10 acg, act, agc, ccg