Abundance of perfect and imperfect trinucleotide repeats in human chromosome 19

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 12510 828223.8814.82 19572 807350.2614.44 12 15.600.47 12 15.600.47 4401 285222.4414.40 6606 281333.8914.20 6978 468205.5413.79 11230 451330.7813.28
aag 4731 28984.675.17 13086 241234.194.31 333 26155.3712.13 648 26302.3412.13 1335 8667.484.35 3214 77162.453.89 2667 15578.564.57 8101 121238.613.56
aat 24507 1415438.5825.32 35513 1362635.5424.37 0 00.000.00 0 00.000.00 9093 521459.6026.33 12835 495648.7325.02 12834 753378.0222.18 19155 730564.2121.50
acc 3687 26565.984.74 6979 224124.904.01 222 16103.587.46 379 16176.837.46 1695 11685.675.86 3363 84169.984.25 1323 10038.972.94 2177 9764.122.86
acg 30 20.540.04 39 20.700.04 15 17.000.47 21 19.800.47 0 00.000.00 0 00.000.00 0 00.000.00 0 00.000.00
act 261 204.670.36 367 206.570.36 0 00.000.00 0 00.000.00 150 117.580.56 213 1110.770.56 84 72.470.21 127 73.740.21
agc 3456 24961.854.46 4714 24484.364.37 1230 85573.8839.66 1875 81874.8137.79 786 6039.733.03 931 6047.063.03 969 7328.542.15 1244 7236.642.12
agg 10023 730179.3713.06 38356 556686.429.95 1428 100666.2646.66 3359 901567.2041.99 3483 257176.0412.99 13193 201666.8310.16 3843 288113.198.48 19241 201566.745.92
atc 3525 23763.084.24 16105 162288.212.90 180 1183.985.13 306 11142.775.13 912 5946.102.98 2438 46123.232.33 1572 10846.303.18 6466 75190.462.21
ccg 3792 25867.864.62 14501 239259.514.28 1044 73487.1034.06 4070 691898.9332.19 444 2922.441.47 1549 2578.291.26 1488 10143.832.98 6472 92190.632.71
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.05810.67856 / 10 acc, acg, act, agc
number per megabase0.9990.43067 / 10 acg, act, atc
Coding regions
length per megabase0.0956.37334 / 10 aac, aat, acc, acg, act, atc
number per megabase1.0000.06245 / 10 aac, aat, acg, act, atc
Introns
length per megabase0.0898.07245 / 10 acg, act, agc, atc, ccg
number per megabase0.9690.54145 / 10 acg, act, agc, atc, ccg
Intergenic regions
length per megabase0.02512.85656 / 10 acc, acg, act, agc
number per megabase0.9830.70356 / 10 acg, act, agc, atc