Eric, you and I are facing the same situations in both spiders and moth taxonomy. Different looking moths named as separate species even though the genitalia don’t differ consistently, OR species distinguished by genitalic differences even though the moths are essentially identical. The examples are innumerable. And then along comes barcoding. Sheez…
The major problem with any of these taxonomic frameworks (phenotypic, genitalic, genetic differentiation) is the lack of any objective definition of a species concept.
Going back several decades, the prevailing idea was the “biological species concept” of Ernst Mayr and others that there had to be some barrier to (successful) interbreeding for two populations to be considered separate species. The array of alternative species concepts has multiplied in the intervening years. There are volumes written about this and I won’t go into them here.
Fundamentally, there is nothing objective about the concept that “if the genitalia differ recognizably and consistently, then the two populations are separate species”. The implication is that if the genitalia differ, that somehow must impose a barrier to successful interbreeding. But how do the spiders or moths respond to those differences? What level of genitalic differences will impede their breeding efforts? To my knowledge, no one has ever defined this, much less attempted to investigate it for any group of organisms or set of species. As a result, at times the descriptions of genitalic differences between two populations of moths (or substitute any other type of organism) read like a parody of the whole topic when an author describes minor differences in the shape of the valvae or uncus of two different moths and declares them separate species.
As for genotypic differentiation, the 2% barcode rule drives me crazy. It is arbitrary and the foundation of it is extremely thin. As I’ve read back in the literature, one broad-based study (on butterflies? if I am remembering correctly) found that the 2% barcode difference correctly predicted the post hoc species ID on the order of 95-98%. Fine. But there was variability in that statistic and what about the 2 to 5% which failed to be classified properly? There is no objective “species concept” in this. Virtually all subsequent barcode-based taxonomic work is founded upon just the one or two original studies which settled on the 2% level of differentiation. It’s a house of cards.
Even with large volume genomic sequencing now being employed, while we may have more confidence in genetic differentiation, I’m still finding a lack of objective definitions of a species concept. Recent papers often offer a very circular logic: “These clades [in a phylogenetic tree] separate out at the level commonly regarded as species-level differentiation, therefore we’ll name these as separate species.”
An example of a study that I recite frequently which attempts to do taxonomy in a comprehensive manner is that of Cong et al. (2020) regarding the North American Junonia (buckeye) butterflies.* They really try to take in the whole biology of the set of populations under study. The outcome is a quagmire but I applaud their efforts!
* Cong, Q., et al. 2020. Speciation in North American Junonia from a genomic perspective. Systematic Entomology (2020), DOI: 10.1111/syen.12428.