Fig. 2

Data sets (grey rectangles) and methods used to derive them (black arrows). The unmodified raw data include digitised language polygons in GeoJSON format and attributes with links to Glottolog in CSV format. The enriched and aggregated data include contemporary and traditional language polygons enriched with Glottocodes at increasing levels of aggregation: features, language areas, and family areas.