Abundance of perfect and imperfect trinucleotide repeats in human chromosome 16

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 15876 1056198.6213.21 24819 1028310.5012.86 0 00.000.00 0 00.000.00 5139 348186.2512.61 7790 341282.3312.36 9087 599178.6511.78 14261 582280.3711.44
aag 6198 36377.544.54 15136 287189.363.59 114 977.236.10 288 9195.106.10 1095 7739.692.79 2795 72101.302.61 4482 24588.124.82 10600 181208.403.56
aat 28314 1709354.2221.38 41405 1654518.0020.69 0 00.000.00 0 00.000.00 8910 522322.9318.92 13291 503481.7118.23 16641 1027327.1620.19 24160 993474.9819.52
acc 5571 38969.704.87 16744 312209.483.90 327 25221.5316.94 593 25401.7316.94 1875 13267.964.78 6227 106225.693.84 2760 18654.263.66 8337 149163.912.93
acg 36 30.450.04 57 30.710.04 0 00.000.00 0 00.000.00 12 10.430.04 21 10.760.04 24 20.470.04 36 20.710.04
act 885 6711.070.84 2636 4832.980.60 0 00.000.00 0 00.000.00 216 177.830.62 383 1613.880.58 618 4612.150.90 2164 2842.540.55
agc 5514 39668.984.95 7911 37898.974.73 1104 76747.9051.49 2334 641581.1743.36 1653 12159.914.38 2057 11874.554.28 2196 15443.173.03 2865 15156.332.97
agg 10245 768128.179.61 27780 621347.547.77 681 48461.3432.52 1676 441135.4029.81 2907 216105.367.83 6579 199238.447.21 5661 426111.308.38 17600 315346.016.19
atc 6456 44380.775.54 24906 336311.594.20 48 432.522.71 84 456.912.71 1911 12969.264.67 5625 116203.874.20 3888 27076.445.31 17092 190336.033.73
ccg 3465 23243.352.90 9026 198112.922.48 582 41394.2727.77 1349 38913.8825.74 426 2915.441.05 1218 2344.140.83 1413 9227.781.81 3983 7578.311.47
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.2726.37256 / 10 acg, act, agc, ccg
number per megabase0.9980.47167 / 10 acg, act, ccg
Coding regions
length per megabase0.9330.43434 / 10 aac, aag, aat, acg, act, atc
number per megabase0.9810.17634 / 10 aac, aag, aat, acg, act, atc
Introns
length per megabase0.3054.83345 / 10 aag, acg, act, agc, ccg
number per megabase1.0000.15256 / 10 aag, acg, act, ccg
Intergenic regions
length per megabase0.1468.19056 / 10 acg, act, agc, ccg
number per megabase0.9900.85867 / 10 acg, act, ccg