I periodically do an audit of the bibliography to gain an understanding of the extent of the coverage. Generally, I focus on 5 areas and for the most part the scope of these categories are self-evident, however there are some nuances to consider.
Type of entry refers to the type of material – journal article, book, book section, et cetera – in which the citation references.
Language of content refers to the language(s) which the referenced document is written. In some cases a document may be written in one or more languages.
Year of publication is pretty self-evident, this refers to the year in which the document was published. In some cases the date of publication is unknown or unidentified so this field is left empty.
Available or unavailable in digital form refers to whether the document is accessible digitally from a reputable and reliable source. This includes content that is freely available as well as content available behind a paywall.
Finally, tagged or untagged refers to whether I have completed a basic level of cataloging of the content. As I add content to the database I attempt to assign relevant tags – this has been a challenge to maintain or sustain.
As of May 21, 2023 – 43,249 unique entries
Type of entry
Type | Quantity | Percent |
---|---|---|
Book | 3,641 | 8.42 % |
Book section | 2,758 | 6.38 % |
Journal or magazine article | 33,465 | 77.38 % |
Manuscript of archival collection | 424 | 0.98 % |
Newspaper article | 1,109 | 2.56 % |
Thesis or dissertation | 1,307 | 3.02 % |
Other | 545 | 1.26 % |
“Other” is a combination of types which individually make up less than 0.5% of the entire dataset. These types include: audio recording, blog post, conference paper, dataset, dictionary entry, document, encyclopedia article, interview, patent, presentation, report, video recording, and web page.
Language of content
Language | Quantity | Percent |
---|---|---|
Dutch | 3,332 | 7.70 % |
English | 26,695 | 61.72 % |
French | 570 | 1.32 % |
German | 2,801 | 6.48 % |
Hungarian | 298 | 0.69 % |
Italian | 6,300 | 14.57 % |
Japanese | 890 | 2.06 % |
Korean | 352 | 0.81 % |
Spanish | 545 | 1.26 % |
Swedish | 230 | 0.53 % |
Other | 1,328 | 3.07 % |
“Other” is a combination of languages which individually make up less than 0.5% of the entire dataset. These languages include: Afrikaans, Arabic, Bosnian, Catalan, Chinese, Croatian, Czech, Danish, Estonian, Finnish, Galician, Greek, Gujarati, Hindi, Icelandic, Indonesian, Latin, Latvian, Lithuanian, Malay, Marathi, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Tamil, Thai, Turkish, Ukrainian, Urdu, Uzbek, Vietnamese, and unidentified. Some of the referenced materials include more than one language.
Year of publication

Accurate as of May 21, 2023.
Available / unavailable online
Available | 15,000 | 34.68 % |
Unavailable | 28,249 | 65.32 % |
Tagged or untagged
With keywords | 16,081 | 37.18 % |
Without keywords | 27,168 | 62.82 % |