I was curious to see how many and how regularly common names were being added to iNat, here are the results for anyone else interested :)
Common names Jan-Feb 2023
Lexicons and name counts from the Darwin Core archives exported on the first of each month.
Top Lexicons
Lexicons with 1000 or more names
#
Lexicon
Names
|
#
Lexicon
Names
1
English
264269
|
28
Zulu
3444
2
Chinese (Simplified)
103690
|
29
Catalan
3322
3
Czech
79041
|
30
Mandarin Chinese
3119
4
Japanese
67843
|
31
Turkish
3045
5
Spanish
60465
|
32
Indonesian
3018
6
Russian
58783
|
33
Setswana
2705
7
Chinese (Traditional)
55805
|
34
Croatian
2378
8
French
47528
|
35
Slovenian
2353
9
German
44454
|
36
Vermont Flora Codes
2273
10
Dutch
37134
|
37
Bulgarian
1955
11
Finnish
36257
|
38
Slovak
1612
12
Portuguese
31243
|
39
Latvian
1540
13
Swedish
28584
|
40
Nahuatl
1511
14
Norwegian
27025
|
41
Armenian
1430
15
Danish
20854
|
42
Xhosa
1401
16
Afrikaans
19031
|
43
Maori
1368
17
Korean
18575
|
44
Tagalog
1354
18
Arabic
15212
|
45
Malay (Individual Language)
1307
19
Thai
13501
|
46
Belarusian
1290
20
Polish
13316
|
47
Malayalam
1254
21
Italian
13129
|
48
Aou 4 Letter Codes
1230
22
Lithuanian
12880
|
49
Slovene
1230
23
Estonian
12143
|
50
AOU 4-Letter Codes
1143
24
Ukrainian
9400
|
51
Ojibwe
1013
25
Hebrew
7215
|
52
Hodges Number
1012
26
Hungarian
7156
|
53
Visayan
1006
27
Greek
4747
|
54
Hawaiian
1003
Slovene is a âbadâ lexicon these names should be in the Slovenian lexicon. While the common name would be searchable it wonât show at the top of the species/taxon page. Adding the two together, ignoring any duplicates, puts Slovenian in 28th position.
Active Lexicons
Lexicons with 30 or more new names
#
Lexicon
Change
|
#
Lexicon
Change
1
Chinese (Simplified)
3472
|
14
Dutch
289
2
Portuguese
1985
|
15
Arabic
252
3
French
1751
|
16
Thai
248
4
English
1381
|
17
Italian
191
5
German
1075
|
18
Scientific Name
115
6
Russian
781
|
19
Korean
110
7
Danish
587
|
20
Hebrew
78
8
Japanese
562
|
21
Swedish
53
9
Lithuanian
530
|
22
Kannada
49
10
Chinese (Traditional)
490
|
23
Ukrainian
47
11
Hungarian
451
|
24
Quechua
46
12
Polish
440
|
25
Slovak
37
13
Spanish
290
|
Another âbadâ lexicon âScientific Nameâ should be âScientific Namesâ.
Iâm pleased to see Nahuatl and Hawaiian on the first list as representatives of western hemisphere indigenous languages. I think Mexicoâs cultural diversity is often overlooked by outsiders, especially people like me who grew up with burritos and fajitas in a certain border state.
Zulu 28, Xhosa 42 and Setswana 33 from Southern Africa. Oh and Afrikaans 16 (a young language)
From my ignorance I cannot pick up other African languages, but most are from Europe (and cosmopolitan English ((French and Spanish too?)) is everywhere, âEsperantoâ on iNat)
I would guess some of the New lexicons are African?
I have a copypasta - but seldom get a chance to use it
Why do not use just the last download? There is a date_added column in the Darwin Core archives, I should 1-1-2023 t/m 31-1-2023 for calculating the activity in januari 2023âŠ
All 11 of the official languages in South Africa have lexicons on iNat, though not quite as well represented as those 4. There are also a few of the unofficial ones too.
The majority are from the Horn of Africa, Iâve added wikipedia links for the curious.
The âcreatedâ column is useful for some queries. I mainly wanted to see total change per lexicon, also which (if any) lexicons were losing names.
Names âtransferredâ out of a âbadâ lexicon into a âgoodâ one will still have the original creation date. New copies of existing common names are re-created on the output taxa as part of taxon changes, there are about 1000 of these names created during January.
Messy, yes! Donât think there are any duplicates, should be a straightforward combination.
Hungarian up 8 places
Lithuanian up 1
Turkish up 2
Slovenian up 2
Indonesian up 1
Malay up 2
The unnamed âlexiconâ (#28 above, #2 below) is the count of names without a lexicon
Active Lexicons
Lexicons with 30 or more new names
#
Lexicon
Change
-
#
Lexicon
Change
1
Hungarian
8881
|
21
Korean
216
2
4604
|
22
Polish
200
3
Chinese (Simplified)
3303
|
23
Turkish
170
4
English
2465
|
24
Cherokee
149
5
French
2078
|
25
Swahili
148
6
Spanish
1425
|
26
Danish
141
7
Malay
1318
|
27
Greek
139
8
Slovenian
994
|
28
Lezghian
134
9
Chinese (Traditional)
830
|
29
Aotearoa (New Zealand) Bilingual Maori And English.
Theyâre the same thing. I thought there was an AOU line in the cleanup script which was run about a week ago, so was surprised to see they hadnât been merged. Iâll need to have another closer look.
In another topic started by Marina is a link to a file with all lexicons of 1-1-2023 but you can also download de DWCA or check the list with lexcions when adding a common name.