CBOLD has gathered data from a variety of sources in the past two and half years; while we a still working on much of it, we are making it available on the WWW for preliminary evaluation.
Please let us know what you think and if you find any of the material useful.
Approximate number of lexical items currently received:
262,006 entries, from 51 sources, containing data on 130 languages.
We have created several lists to make it easier to find out what data CBOLD has:
List of data sources by language Some of the files contain words from several different Bantu languages; this list makes it possible to find out which file or files contain data for a particular language. Note that the name of the language is given without a prefix (i.e. look under B for Bemba, not C for Cibemba).
Search database of sources A query form for searching the inventory of sources.
as of 11/1/96 jbl