Abundance of perfect and imperfect trinucleotide repeats in chimpanzee chromosome 4

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 35334 2329176.3711.62 52453 2285261.8111.40 0 00.000.00 0 00.000.00 9084 589184.1311.94 13234 580268.2511.76 24804 1653165.7011.04 37201 1618248.5210.81
aag 10518 71552.503.57 21675 623108.193.11 129 1097.997.60 183 10139.017.60 2298 15746.583.18 4659 14394.442.90 7770 52451.913.50 16159 448107.952.99
aat 55278 3686275.9218.40 82762 3608413.1018.01 12 19.120.76 18 113.670.76 13899 907281.7318.39 20634 884418.2417.92 39240 2631262.1417.58 58902 2577393.4917.21
acc 6540 48732.642.43 9846 46749.152.33 168 13127.629.88 290 12220.299.12 1830 13637.092.76 3026 12361.342.49 4329 32128.922.14 6219 31541.552.10
acg 36 30.180.01 63 30.310.01 0 00.000.00 0 00.000.00 24 20.490.04 51 21.030.04 12 10.080.01 12 10.080.01
act 2121 14910.590.74 3104 14315.490.71 0 00.000.00 0 00.000.00 561 3911.370.79 739 3714.980.75 1506 10610.060.71 2281 10215.240.68
agc 5781 40728.862.03 7748 40138.672.00 384 27291.6920.51 664 26504.3919.75 1377 9727.911.97 1818 9536.851.93 3786 26725.291.78 4968 26433.191.76
agg 9354 71746.693.58 19304 66296.353.30 453 35344.1126.59 959 32728.4824.31 2475 19150.173.87 5014 178101.633.61 5790 44438.682.97 11883 40979.382.73
atc 8166 55740.762.78 15046 54575.102.72 72 654.694.56 165 6125.344.56 2286 15346.343.10 3460 14970.133.02 5439 37236.342.48 10909 36472.882.43
ccg 2031 13410.140.67 6111 12130.500.60 252 18191.4313.67 683 18518.8213.67 558 3511.310.71 1957 3039.670.61 1095 717.320.47 3099 6320.700.42
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.9300.85845 / 10 acc, acg, act, agc, ccg
number per megabase1.0000.05956 / 10 acg, act, agc, ccg
Coding regions
length per megabase0.7581.88045 / 10 aac, aat, acg, act, atc
number per megabase1.0000.07156 / 10 aac, aat, acg, act
Introns
length per megabase0.9820.70956 / 10 acg, act, agc, ccg
number per megabase1.0000.03856 / 10 acg, act, agc, ccg
Intergenic regions
length per megabase0.9200.93045 / 10 acc, acg, act, agc, ccg
number per megabase1.0000.07856 / 10 acg, act, agc, ccg