Abundance of perfect and imperfect trinucleotide repeats in human chromosome 20

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 10674 695179.6211.70 16307 678274.4111.41 0 00.000.00 0 00.000.00 3054 199163.5410.66 4741 197253.8810.55 6900 447172.8111.20 10523 433263.5510.85
aag 4761 29580.124.96 10193 216171.533.63 144 11175.0513.37 223 11271.0813.37 975 6552.213.48 2074 58111.063.11 3480 20987.165.23 7521 137188.363.43
aat 17916 1128301.4918.98 26057 1105438.4918.59 0 00.000.00 0 00.000.00 5400 341289.1718.26 7674 336410.9417.99 11049 696276.7217.43 16322 679408.7917.01
acc 4047 29168.104.90 11175 236188.053.97 63 576.586.08 144 5175.056.08 1347 9672.135.14 2851 83152.674.45 2325 17058.234.26 7732 129193.653.23
acg 39 30.660.05 51 30.860.05 12 114.591.22 12 114.591.22 0 00.000.00 0 00.000.00 12 10.300.03 12 10.300.03
act 570 389.590.64 1059 3617.820.61 0 00.000.00 0 00.000.00 207 1411.090.75 315 1316.870.70 315 207.890.50 684 1917.130.48
agc 3699 27462.254.61 4956 26683.404.48 648 46787.7155.92 1229 421493.9851.05 1020 7554.624.02 1201 7464.313.96 1776 13444.483.36 2176 13154.503.28
agg 5907 42799.407.19 12773 359214.946.04 420 29510.5535.25 1063 231292.1927.96 1578 12084.506.43 2941 115157.496.16 3474 25187.016.29 7997 197200.294.93
atc 3477 23458.513.94 15230 210256.293.53 48 458.354.86 120 4145.874.86 1068 6457.193.43 2005 63107.373.37 2184 15254.703.81 12768 129319.783.23
ccg 1659 11127.921.87 5397 9390.821.56 339 24412.0929.18 1180 211434.4125.53 219 1511.730.80 678 1236.310.64 663 4216.611.05 2199 3555.070.88
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.3906.30367 / 10 acg, act, ccg
number per megabase0.9990.43967 / 10 acg, act, ccg
Coding regions
length per megabase0.3093.59534 / 10 aac, aat, acc, acg, act, atc
number per megabase0.9690.25134 / 10 aac, aat, acc, acg, act, atc
Introns
length per megabase0.9601.48867 / 10 acg, act, ccg
number per megabase1.0000.09567 / 10 acg, act, ccg
Intergenic regions
length per megabase0.06910.21956 / 10 acg, act, agc, ccg
number per megabase0.9900.86167 / 10 acg, act, ccg