Abundance of perfect and imperfect trinucleotide repeats in human chromosome 22

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 6363 426185.2312.40 9797 415285.1912.08 0 00.000.00 0 00.000.00 2505 168187.6412.58 3830 164286.8912.29 3324 224164.2111.07 5088 218251.3610.77
aag 2331 13367.863.87 4859 99141.452.88 60 578.936.58 87 5114.456.58 687 4051.463.00 1515 34113.482.55 1458 7972.033.90 3034 53149.892.62
aat 10578 629307.9318.31 14957 604435.4017.58 0 00.000.00 0 00.000.00 3780 225283.1516.85 5423 214406.2216.03 5967 352294.7817.39 8376 341413.7916.85
acc 3366 23297.986.75 7874 142229.224.13 114 9149.9711.84 165 9217.0611.84 1008 7875.515.84 3662 66274.314.94 2115 135104.496.67 3900 57192.672.82
acg 12 10.350.03 12 10.350.03 12 115.791.32 12 115.791.32 0 00.000.00 0 00.000.00 0 00.000.00 0 00.000.00
act 96 62.790.17 151 64.400.17 0 00.000.00 0 00.000.00 12 10.900.07 33 12.470.07 84 54.150.25 118 55.830.25
agc 2799 20581.485.97 4081 196118.805.71 714 46939.2960.52 1538 382023.3049.99 825 5961.804.42 1011 5875.734.34 1038 8251.284.05 1226 8260.574.05
agg 5079 380147.8511.06 12020 327349.919.52 363 26477.5434.20 709 24932.7231.57 1509 114113.038.54 3179 106238.137.94 2661 199131.469.83 6805 160336.187.90
atc 2571 17974.845.21 11985 124348.893.61 66 586.836.58 84 5110.506.58 882 6066.074.49 3795 49284.273.67 1500 10574.105.19 7931 61391.813.01
ccg 1356 9439.472.74 3757 84109.372.44 207 16272.3221.05 551 13724.8617.10 183 1313.710.97 445 1233.330.90 480 3323.711.63 1546 3076.381.48
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.3147.07467 / 10 acg, act, ccg
number per megabase0.9801.12767 / 10 acg, act, ccg
Coding regions
length per megabase0.8001.00734 / 10 aac, aag, aat, acg, act, atc
number per megabase0.9700.24334 / 10 aac, aag, aat, acg, act, atc
Introns
length per megabase0.1448.22856 / 10 acg, act, agc, ccg
number per megabase0.9990.16456 / 10 aag, acg, act, ccg
Intergenic regions
length per megabase0.0949.39156 / 10 acg, act, agc, ccg
number per megabase0.7783.24167 / 10 acg, act, ccg