Abundance of perfect and imperfect trinucleotide repeats in human chromosome 4

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 33318 2228177.9411.90 49576 2191264.7711.70 12 18.970.75 12 18.970.75 8856 583186.9012.30 12746 577269.0012.18 22383 1507161.5910.88 33768 1479243.7810.68
aag 11175 71559.683.82 24527 593130.993.17 165 13123.399.72 207 13154.809.72 2517 15453.123.25 5340 137112.702.89 7767 50556.073.65 17474 406126.152.93
aat 57585 3755307.5520.05 84832 3678453.0719.64 0 00.000.00 0 00.000.00 14658 925309.3519.52 21224 903447.9319.06 39351 2595284.0818.73 58370 2543421.3818.36
acc 8514 62045.473.31 18263 48897.542.61 180 13134.619.72 275 11205.658.23 2367 16849.953.55 5895 134124.412.83 5484 40339.592.91 11416 30782.412.22
acg 24 20.130.01 27 20.140.01 12 18.970.75 15 111.220.75 0 00.000.00 0 00.000.00 0 00.000.00 0 00.000.00
act 1803 1299.630.69 2764 12614.760.67 0 00.000.00 0 00.000.00 594 4112.540.86 811 4017.120.84 1149 838.290.60 1863 8113.450.58
agc 5937 43031.712.30 7855 41941.952.24 612 38457.6728.42 961 35718.6526.17 1413 10029.822.11 1719 9836.282.07 3645 27226.311.96 4781 26634.521.92
agg 10203 77354.494.13 21259 669113.543.57 456 34341.0025.43 837 32625.9223.93 2769 20758.444.37 5947 182125.513.84 5916 45142.713.26 12518 38290.372.76
atc 8781 58746.903.13 17212 53991.922.88 72 653.844.49 165 6123.394.49 2322 15949.013.36 6060 142127.893.00 5133 35837.062.58 8750 35163.172.53
ccg 2544 17513.590.94 8418 16644.960.89 447 31334.2723.18 1734 301296.7222.43 213 164.500.34 588 1512.410.32 1359 949.810.68 4673 8733.730.63
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.9241.40156 / 10 acg, act, agc, ccg
number per megabase0.9980.25956 / 10 acg, act, agc, ccg
Coding regions
length per megabase0.0718.64045 / 10 aac, aat, acg, act, atc
number per megabase0.9990.08945 / 10 aac, aat, acg, act, atc
Introns
length per megabase0.7442.71256 / 10 acg, act, agc, ccg
number per megabase0.9990.20056 / 10 acg, act, agc, ccg
Intergenic regions
length per megabase0.9361.28556 / 10 acg, act, agc, ccg
number per megabase0.9970.34556 / 10 acg, act, agc, ccg