This dataset was developed with the aid of #computational tools to address some major shortcomings of previous databases, namely #coverage, #comparability, and #extensibility. The 1,942 languages represent 155 language families and 78 isolates, and more than 1,850 of 2/5