Natural Order LMs
Collection
All the models trained in the paper 'Natural Order: Cross-lingual Limits of Transformer Language Acquisition'
•
35 items
•
Updated
This model is a fine-tuned version of on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 76.0554 | 0.9714 | 17 | 9.4159 |
| 71.5116 | 1.9714 | 34 | 8.9223 |
| 68.1104 | 2.9714 | 51 | 8.4983 |
| 64.8415 | 3.9714 | 68 | 8.0835 |
| 61.5923 | 4.9714 | 85 | 7.6397 |
| 57.4736 | 5.9714 | 102 | 7.1809 |
| 53.9559 | 6.9714 | 119 | 6.7377 |
| 51.5502 | 7.9714 | 136 | 6.4381 |
| 50.2245 | 8.9714 | 153 | 6.2864 |
| 49.7075 | 9.9714 | 170 | 6.1920 |
| 48.9016 | 10.9714 | 187 | 6.1134 |
| 48.5056 | 11.9714 | 204 | 6.0484 |
| 47.7919 | 12.9714 | 221 | 5.9937 |
| 47.6727 | 13.9714 | 238 | 5.9478 |
| 47.0178 | 14.9714 | 255 | 5.9050 |
| 46.7872 | 15.9714 | 272 | 5.8797 |
| 46.6167 | 16.9714 | 289 | 5.8400 |
| 46.0477 | 17.9714 | 306 | 5.8148 |
| 45.8791 | 18.9714 | 323 | 5.7810 |
| 45.4665 | 19.9714 | 340 | 5.7590 |
| 45.567 | 20.9714 | 357 | 5.7255 |
| 45.1798 | 21.9714 | 374 | 5.6946 |
| 44.9232 | 22.9714 | 391 | 5.6630 |
| 44.5422 | 23.9714 | 408 | 5.6316 |
| 44.3638 | 24.9714 | 425 | 5.6000 |
| 43.7262 | 25.9714 | 442 | 5.5667 |
| 43.5779 | 26.9714 | 459 | 5.5362 |
| 43.2879 | 27.9714 | 476 | 5.5000 |
| 42.837 | 28.9714 | 493 | 5.4680 |
| 42.8458 | 29.9714 | 510 | 5.4374 |
| 42.5224 | 30.9714 | 527 | 5.4100 |
| 42.426 | 31.9714 | 544 | 5.3854 |
| 42.0329 | 32.9714 | 561 | 5.3660 |
| 41.6448 | 33.9714 | 578 | 5.3379 |
| 41.5732 | 34.9714 | 595 | 5.3176 |
| 41.415 | 35.9714 | 612 | 5.2978 |
| 41.18 | 36.9714 | 629 | 5.2752 |
| 40.7777 | 37.9714 | 646 | 5.2583 |
| 40.6049 | 38.9714 | 663 | 5.2439 |
| 40.3485 | 39.9714 | 680 | 5.2284 |
| 40.0764 | 40.9714 | 697 | 5.2123 |
| 40.0139 | 41.9714 | 714 | 5.2003 |
| 39.7959 | 42.9714 | 731 | 5.1847 |
| 39.5697 | 43.9714 | 748 | 5.1716 |
| 39.4945 | 44.9714 | 765 | 5.1659 |
| 39.1866 | 45.9714 | 782 | 5.1527 |
| 39.2213 | 46.9714 | 799 | 5.1453 |
| 38.8962 | 47.9714 | 816 | 5.1360 |
| 38.6798 | 48.9714 | 833 | 5.1261 |
| 38.3602 | 49.9714 | 850 | 5.1270 |
| 38.2267 | 50.9714 | 867 | 5.1193 |
| 37.9815 | 51.9714 | 884 | 5.1154 |
| 37.8963 | 52.9714 | 901 | 5.1140 |
| 37.6935 | 53.9714 | 918 | 5.1109 |
| 37.7724 | 54.9714 | 935 | 5.1098 |
| 37.0604 | 55.9714 | 952 | 5.1090 |
| 37.3595 | 56.9714 | 969 | 5.1066 |
| 37.1589 | 57.9714 | 986 | 5.1066 |
| 36.6485 | 58.9714 | 1003 | 5.1044 |
| 36.7598 | 59.9714 | 1020 | 5.1052 |
| 36.5849 | 60.9714 | 1037 | 5.1088 |
| 36.1596 | 61.9714 | 1054 | 5.1110 |
| 36.3068 | 62.9714 | 1071 | 5.1163 |
| 36.3486 | 63.9714 | 1088 | 5.1161 |
| 35.9179 | 64.9714 | 1105 | 5.1214 |
| 35.6792 | 65.9714 | 1122 | 5.1266 |
| 35.4873 | 66.9714 | 1139 | 5.1295 |
| 35.3164 | 67.9714 | 1156 | 5.1387 |
| 35.0348 | 68.9714 | 1173 | 5.1421 |
| 35.1965 | 69.9714 | 1190 | 5.1503 |
| 35.086 | 70.9714 | 1207 | 5.1529 |
| 34.6337 | 71.9714 | 1224 | 5.1637 |
| 34.7382 | 72.9714 | 1241 | 5.1662 |
| 34.4871 | 73.9714 | 1258 | 5.1757 |
| 34.2342 | 74.9714 | 1275 | 5.1806 |
| 34.1668 | 75.9714 | 1292 | 5.1919 |
| 34.0995 | 76.9714 | 1309 | 5.1998 |
| 33.8965 | 77.9714 | 1326 | 5.2114 |
| 33.9098 | 78.9714 | 1343 | 5.2200 |
| 33.63 | 79.9714 | 1360 | 5.2305 |
| 33.4706 | 80.9714 | 1377 | 5.2291 |
| 33.505 | 81.9714 | 1394 | 5.2448 |
| 33.3618 | 82.9714 | 1411 | 5.2457 |
| 33.132 | 83.9714 | 1428 | 5.2597 |
| 33.0071 | 84.9714 | 1445 | 5.2663 |
| 32.8751 | 85.9714 | 1462 | 5.2750 |
| 32.8287 | 86.9714 | 1479 | 5.2870 |
| 32.4965 | 87.9714 | 1496 | 5.2981 |
| 32.6413 | 88.9714 | 1513 | 5.3074 |
| 32.5603 | 89.9714 | 1530 | 5.3142 |
| 32.2966 | 90.9714 | 1547 | 5.3253 |
| 32.1185 | 91.9714 | 1564 | 5.3355 |
| 32.0684 | 92.9714 | 1581 | 5.3424 |
| 32.202 | 93.9714 | 1598 | 5.3535 |
| 31.5632 | 94.9714 | 1615 | 5.3645 |
| 31.565 | 95.9714 | 1632 | 5.3747 |
| 31.4838 | 96.9714 | 1649 | 5.3822 |
| 31.4564 | 97.9714 | 1666 | 5.3918 |
| 31.3305 | 98.9714 | 1683 | 5.3993 |
| 31.3431 | 99.9714 | 1700 | 5.4117 |
| 31.1942 | 100.9714 | 1717 | 5.4160 |
| 30.9246 | 101.9714 | 1734 | 5.4310 |
| 30.8694 | 102.9714 | 1751 | 5.4373 |
| 30.8388 | 103.9714 | 1768 | 5.4432 |
| 30.6456 | 104.9714 | 1785 | 5.4533 |
| 30.5814 | 105.9714 | 1802 | 5.4659 |
| 30.5805 | 106.9714 | 1819 | 5.4699 |
| 30.5545 | 107.9714 | 1836 | 5.4812 |
| 30.4305 | 108.9714 | 1853 | 5.4908 |
| 30.157 | 109.9714 | 1870 | 5.4988 |
| 29.9876 | 110.9714 | 1887 | 5.5073 |
| 30.1266 | 111.9714 | 1904 | 5.5117 |
| 29.8895 | 112.9714 | 1921 | 5.5229 |
| 29.7649 | 113.9714 | 1938 | 5.5295 |
| 29.8926 | 114.9714 | 1955 | 5.5350 |
| 29.6378 | 115.9714 | 1972 | 5.5491 |
| 29.6415 | 116.9714 | 1989 | 5.5559 |
| 29.7529 | 117.9714 | 2006 | 5.5609 |
| 29.3384 | 118.9714 | 2023 | 5.5695 |
| 29.359 | 119.9714 | 2040 | 5.5755 |
| 29.3304 | 120.9714 | 2057 | 5.5825 |
| 29.2433 | 121.9714 | 2074 | 5.5912 |
| 29.0092 | 122.9714 | 2091 | 5.5983 |
| 29.3211 | 123.9714 | 2108 | 5.6037 |
| 29.0934 | 124.9714 | 2125 | 5.6071 |
| 28.9074 | 125.9714 | 2142 | 5.6124 |
| 28.8782 | 126.9714 | 2159 | 5.6202 |
| 28.9611 | 127.9714 | 2176 | 5.6302 |
| 28.8142 | 128.9714 | 2193 | 5.6343 |
| 28.6845 | 129.9714 | 2210 | 5.6432 |
| 28.738 | 130.9714 | 2227 | 5.6447 |
| 28.7464 | 131.9714 | 2244 | 5.6530 |
| 28.6346 | 132.9714 | 2261 | 5.6572 |
| 28.4787 | 133.9714 | 2278 | 5.6630 |
| 28.504 | 134.9714 | 2295 | 5.6695 |
| 28.3428 | 135.9714 | 2312 | 5.6756 |
| 28.3463 | 136.9714 | 2329 | 5.6787 |
| 28.496 | 137.9714 | 2346 | 5.6803 |
| 28.3236 | 138.9714 | 2363 | 5.6871 |
| 28.2455 | 139.9714 | 2380 | 5.6904 |
| 28.2482 | 140.9714 | 2397 | 5.6949 |
| 28.1263 | 141.9714 | 2414 | 5.6998 |
| 28.1918 | 142.9714 | 2431 | 5.7042 |
| 28.1143 | 143.9714 | 2448 | 5.7056 |
| 28.0906 | 144.9714 | 2465 | 5.7094 |
| 28.0002 | 145.9714 | 2482 | 5.7113 |
| 27.8926 | 146.9714 | 2499 | 5.7156 |
| 28.1241 | 147.9714 | 2516 | 5.7199 |
| 28.0328 | 148.9714 | 2533 | 5.7210 |
| 27.9288 | 149.9714 | 2550 | 5.7237 |
| 27.9317 | 150.9714 | 2567 | 5.7265 |
| 27.7598 | 151.9714 | 2584 | 5.7274 |
| 27.9603 | 152.9714 | 2601 | 5.7322 |
| 27.7704 | 153.9714 | 2618 | 5.7333 |
| 27.7615 | 154.9714 | 2635 | 5.7345 |
| 27.7547 | 155.9714 | 2652 | 5.7361 |
| 27.7851 | 156.9714 | 2669 | 5.7389 |
| 27.726 | 157.9714 | 2686 | 5.7394 |
| 27.7772 | 158.9714 | 2703 | 5.7407 |
| 27.8406 | 159.9714 | 2720 | 5.7429 |
| 27.7328 | 160.9714 | 2737 | 5.7429 |
| 27.6921 | 161.9714 | 2754 | 5.7444 |
| 27.6875 | 162.9714 | 2771 | 5.7458 |
| 27.8319 | 163.9714 | 2788 | 5.7465 |
| 27.7842 | 164.9714 | 2805 | 5.7467 |
| 27.6917 | 165.9714 | 2822 | 5.7481 |
| 27.6102 | 166.9714 | 2839 | 5.7489 |
| 27.8374 | 167.9714 | 2856 | 5.7481 |
| 27.6557 | 168.9714 | 2873 | 5.7492 |
| 27.5827 | 169.9714 | 2890 | 5.7491 |
| 27.7106 | 170.9714 | 2907 | 5.7492 |
| 27.6843 | 171.9714 | 2924 | 5.7496 |
| 27.6455 | 172.9714 | 2941 | 5.7494 |
| 27.6835 | 173.9714 | 2958 | 5.7496 |
| 27.6126 | 174.9714 | 2975 | 5.7497 |
| 27.6715 | 175.9714 | 2992 | 5.7497 |
| 27.734 | 176.4571 | 3000 | 5.7497 |