Abundance of perfect and imperfect trinucleotide repeats in chimpanzee chromosome 5

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 34371 2217179.9011.60 51205 2176268.0011.39 39 325.871.99 57 337.811.99 9057 602175.9211.69 13321 591258.7511.48 23535 1505170.4610.90 35240 1478255.2310.71
aag 9132 61847.803.23 18809 54698.442.86 195 16129.3510.61 259 16171.8010.61 2118 14441.142.80 3696 13271.792.56 6438 43346.633.14 14008 374101.462.71
aat 49695 3277260.1017.15 74410 3193389.4616.71 12 17.960.66 15 19.950.66 13488 882261.9917.13 19870 853385.9516.57 34476 2272249.7016.45 51876 2219375.7216.07
acc 6795 50135.562.62 10411 49354.492.58 216 15143.289.95 321 14212.929.29 2049 15139.802.93 2792 15054.232.91 4038 30029.252.17 6569 29747.582.15
acg 36 30.190.02 81 30.420.02 24 215.921.33 66 243.781.33 0 00.000.00 0 00.000.00 12 10.090.01 15 10.110.01
act 1947 12810.190.67 3434 11817.970.62 0 00.000.00 0 00.000.00 558 3510.840.68 752 3414.610.66 1335 909.670.65 2595 8118.800.59
agc 6927 51036.262.67 9006 50347.142.63 564 43374.1128.52 1014 40672.6026.53 1590 11830.882.29 2061 11840.032.29 4368 31831.642.30 5451 31539.482.28
agg 9672 73850.623.86 19502 676102.073.54 249 19165.1712.60 469 18311.1011.94 2481 19148.193.71 5192 180100.853.50 6387 48546.263.51 12823 43692.873.16
atc 8817 59246.153.10 14295 55774.822.92 159 12105.477.96 291 10193.036.63 2811 18254.603.54 4643 17590.193.40 5502 37539.852.72 8783 34963.612.53
ccg 2049 13810.720.72 5868 12730.710.67 426 31282.5720.56 956 28634.1318.57 204 133.960.25 721 1214.010.23 1086 737.870.53 3316 6824.020.49
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.9840.68956 / 10 acg, act, agc, ccg
number per megabase1.0000.04967 / 10 acg, act, ccg
Coding regions
length per megabase0.9401.25556 / 10 aac, aat, acg, act
number per megabase1.0000.09456 / 10 aac, aat, acg, act
Introns
length per megabase0.9870.61356 / 10 acg, act, agc, ccg
number per megabase1.0000.02067 / 10 acg, act, ccg
Intergenic regions
length per megabase0.9320.84945 / 10 acc, acg, act, agc, ccg
number per megabase1.0000.07967 / 10 acg, act, ccg