Abundance of perfect and imperfect trinucleotide repeats in human chromosome 10

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 24300 1576184.0711.94 36051 1542273.0811.68 12 18.850.74 16 111.800.74 8625 566178.1211.69 12642 550261.0711.36 14337 923174.3411.22 21346 906259.5611.02
aag 9012 55268.264.18 18308 457138.683.46 252 19185.9114.02 537 19396.1714.02 2820 17358.243.57 5762 139118.992.87 5592 33368.004.05 11160 277135.703.37
aat 38247 2404289.7118.21 55586 2346421.0517.77 15 111.070.74 15 111.070.74 12855 830265.4717.14 18747 819387.1516.91 23304 1439283.3717.50 33860 1396411.7316.98
acc 5844 44044.273.33 13577 390102.842.95 282 20208.0414.76 450 17331.9912.54 2214 16345.723.37 5636 142116.392.93 2913 22535.422.74 6705 19981.532.42
acg 42 30.320.02 116 30.880.02 42 330.982.21 116 385.582.21 0 00.000.00 0 00.000.00 0 00.000.00 0 00.000.00
act 1401 9710.610.73 2096 9315.880.70 0 00.000.00 0 00.000.00 744 4915.371.01 1041 4521.500.93 585 437.110.52 900 4310.940.52
agc 5661 43342.883.28 7083 42953.653.25 426 30314.2822.13 667 29492.0821.39 1956 15040.393.10 2257 15046.613.10 2853 22134.692.69 3564 21843.342.65
agg 10464 77979.265.90 21409 670162.175.08 447 33329.7724.35 801 30590.9322.13 3720 28176.825.80 8167 239168.664.94 5580 41367.855.02 11252 357136.824.34
atc 7617 48357.703.66 17478 431132.393.27 177 14130.5810.33 261 11192.558.12 3414 21670.504.46 7964 192164.473.96 3492 22442.462.72 8438 203102.602.47
ccg 3012 20322.821.54 8982 18568.041.40 621 44458.1432.46 1633 401204.7429.51 333 226.880.45 971 2120.050.43 1176 7814.300.95 3954 7048.080.85
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.8751.80756 / 10 acg, act, agc, ccg
number per megabase1.0000.18267 / 10 acg, act, ccg
Coding regions
length per megabase0.7812.47356 / 10 aac, aat, acg, act
number per megabase0.9990.20656 / 10 aac, aat, acg, act
Introns
length per megabase0.7902.40956 / 10 acg, act, agc, ccg
number per megabase1.0000.24067 / 10 acg, act, ccg
Intergenic regions
length per megabase0.8841.74256 / 10 acg, act, agc, ccg
number per megabase1.0000.16367 / 10 acg, act, ccg