Skip to content

Commit 47125d4

Browse files
committed
upload training datasets
1 parent 33b1106 commit 47125d4

5 files changed

Lines changed: 385800 additions & 0 deletions
Lines changed: 342 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,342 @@
1+
language_as_recorded,language_code,authorized_label,structured_value,ds_qid
2+
Text in Sinhala,,,,
3+
Text in Latin,,,,
4+
Text in Latin and German,,,,
5+
Text in Greek and Latin,,,,
6+
Text in Ancient Greek,,,,
7+
In Spanish,,,,
8+
Text in Italian,,,,
9+
arm,,,,
10+
sin,,,,
11+
Text in Greek,,,,
12+
ger,,,,
13+
lat,,,,
14+
Modern Javanese,,,,
15+
Text in Spanish,,,,
16+
Text in Armenian,,,,
17+
Text in Dutch,,,,
18+
In Latin,,,,
19+
In Pali,,,,
20+
Text in English and Latin,,,,
21+
In Geez,,,,
22+
"In Batak (Poda, Toba script). The script uses the southern ta. Two variants of the letter na occur; next to the standard form the ancient na (as Voorhoeve called it) is also occasionally used. The ancient na is typically found in Mandailing manuscripts, but this manuscript is clearly not Mandailing, but was probably written in the Toba district",,,,
23+
In Ge'ez,,,,
24+
Text in Church Slavic,,,,
25+
"In Batak (Poda, Toba script)",,,,
26+
Text in Ge'ez,,,,
27+
In French,,,,
28+
Text written in Sanskrit,,,,
29+
eng,,,,
30+
In Latin and French,,,,
31+
Text written in Tibetan,,,,
32+
Text in Latin and Spanish,,,,
33+
In Latin and Spanish,,,,
34+
syr,,,,
35+
In Chinese and Japanese,,,,
36+
jpn,,,,
37+
In Arabic with rubrics and marginal text in Persian,,,,
38+
Text in Arabic,,,,
39+
Text in Pali and Sinhala,,,,
40+
Text in Latin and Italian,,,,
41+
Text in German,,,,
42+
In Batak (Poda),,,,
43+
Text in Latin and English,,,,
44+
Text in French,,,,
45+
In Italian,,,,
46+
Low-German dialect,,,,
47+
fre,,,,
48+
"In Dutch, three documents in French",,,,
49+
Text in English,,,,
50+
ita,,,,
51+
Hebrew,,,,
52+
Text in Syriac,,,,
53+
In Middle French,,,,
54+
"In Latin, with some Italian in epistolary formulas",,,,
55+
In Japanese,,,,
56+
san,,,,
57+
Written in Pali,,,,
58+
"In Persian, Tajik script",,,,
59+
Text in Pali,,,,
60+
Latin words,,,,
61+
Text in French and Latin,,,,
62+
Text in Italian and Latin,,,,
63+
Text in Latin with notes in Italian in the margins,,,,
64+
In Persian,,,,
65+
tib,,,,
66+
In Greek,,,,
67+
"In Kanbun, without typical Japanese reading marks",,,,
68+
In Syriac,,,,
69+
Italian,,,,
70+
Text in Latin; translated from Italian,,,,
71+
Text in Greek and Church Slavonic,,,,
72+
Text in Latin and French,,,,
73+
per,,,,
74+
In German and Latin,,,,
75+
Latin,,,,
76+
In Hebrew,,,,
77+
Text in Greek and Arabic,,,,
78+
In Spanish and Latin,,,,
79+
Text written in Tamil,,,,
80+
gre,,,,
81+
In Latin; fol. 68v in English,,,,
82+
"Text in English, translated from French",,,,
83+
Text in Bengali and Sanskrit,,,,
84+
Greek,,,,
85+
Text in Latin and with marginalia in Latin and Greek,,,,
86+
"Text chiefly in Latin, with prayer to the Holy Trinity (fol. 92) in French",,,,
87+
heb,,,,
88+
und,,,,
89+
In German,,,,
90+
Text in multiple languages,,,,
91+
Text written in Sinhalese script,,,,
92+
"Text in Latin, French, Anglo-Norman French, and English",,,,
93+
Text in German and some Latin,,,,
94+
Texts in Latin,,,,
95+
In English,,,,
96+
Latin and German,,,,
97+
Arabic,,,,
98+
Chiefly in Latin with some Italian,,,,
99+
"Middle High German, Rhenish (Middle Franconian) dialect",,,,
100+
"Latin, with a few documents in Italian",,,,
101+
In Italian and Latin,,,,
102+
Latin and Italian,,,,
103+
"Latin, with one leaf in Italian (f. 2r)",,,,
104+
German,,,,
105+
Italian and Latin,,,,
106+
Latin and Low German,,,,
107+
Hebrew (vocalized),,,,
108+
"Latin, with some words in German (f. 54r, 55r, 89r)",,,,
109+
In Dutch,,,,
110+
"In Latin, with a few words in Italian",,,,
111+
German; liturgical text on covers in Latin,,,,
112+
"Latin, with short passages in German",,,,
113+
"In Latin, with later summaries in French (verso)",,,,
114+
Middle French,,,,
115+
In Latin and Italian,,,,
116+
"Italian, in Tuscan dialect",,,,
117+
"In Coptic, with a few marginal notations of later date in Arabic and German",,,,
118+
"Some items in Latin, some in Italian",,,,
119+
"In Latin, with German words (f. 2r-51r)",,,,
120+
"Italian, probably Tuscan, with some prayers in Latin",,,,
121+
Spanish,,,,
122+
"Latin, with a few headings in French",,,,
123+
"Latin, with some French translations in the first conjugation table (f. 8r-9r)",,,,
124+
"Latin, with a few notes in Italian",,,,
125+
Latin and French,,,,
126+
"German, with some French on two leaves (f. 153-154)",,,,
127+
"Latin, with one section (f. 267r-272r) and some recipes, notes, and names in Italian",,,,
128+
Persian,,,,
129+
"Latin, with calendar and accessory texts in French",,,,
130+
"In Italian, with a few words in Latin",,,,
131+
"In Italian, with notarial signatures in Latin",,,,
132+
Written in Latin with a small amount of Italian (f. 91r-94v),,,,
133+
"Latin, with occasional paragraphs in Italian",,,,
134+
"Middle High German, with some Latin",,,,
135+
"Italian, in the Venetian dialect",,,,
136+
"Latin, with calendar in French (f. 1r-12v)",,,,
137+
Provençal,,,,
138+
"Italian, with a short work in Latin (f. 26r-30v)",,,,
139+
"In Latin, with some documents in Italian",,,,
140+
Old French,,,,
141+
Italian; possibly in the Florentine vernacular,,,,
142+
"Latin, with some rubrics and directions in French",,,,
143+
"Catalan, with prologue in Latin",,,,
144+
French,,,,
145+
Persian and Arabic,,,,
146+
"Latin, with occasional words in Greek",,,,
147+
Hebrew; introduction in Italian,,,,
148+
"German, with Latin phrases",,,,
149+
"Italian, with Greek on flyleaves",,,,
150+
Italian with a short section in Latin (f. 49r-49v),,,,
151+
"In German, with Latin phrase",,,,
152+
"Latin, with a poem in Middle English",,,,
153+
"In Italian, with first paragraph and notarial signature in Latin",,,,
154+
"Latin, with some passages in German (f. 10v, 20v-21r, 40v-43v, 83v-84v, 91r-93v) and Czech (f. 11v, 26v-28v)",,,,
155+
"Latin, with one inscription in Hebrew (f. 20r)",,,,
156+
Added notes in Italian (Volume 1),,,,
157+
"Latin, with annotations in Latin and Greek",,,,
158+
"Latin, with some Italian (f. 5r-7v, 13r-15v, 46r-51v) and Spanish (f. 9r-11v)",,,,
159+
Italian and Spanish,,,,
160+
"Hebrew, with occasional notes in Italian (f. 22r, 24v, 26v, 27v, 82v, 83v)",,,,
161+
"Latin, with some Italian",,,,
162+
"Latin with occasional words in Greek (f. 21r, 134v)",,,,
163+
"Middle French, with verses in Old French",,,,
164+
Ancient Greek,,,,
165+
"Italian, with some Latin, most frequently for the openings and closings of documents",,,,
166+
Low German,,,,
167+
"Latin, with a few words in Middle English",,,,
168+
"Italian, with a few headings, etc. in Latin",,,,
169+
"Latin, with two recipes in Italian",,,,
170+
"French, with citations and margin notes in Latin",,,,
171+
"Latin, with words transcribed from Arabic",,,,
172+
"Middle English, with some Latin notations",,,,
173+
Ottoman Turkish with some Arabic,,,,
174+
"Italian (Venetian dialect), with three recipes in Latin (f. 17r)",,,,
175+
"Italian, with a preface in Latin",,,,
176+
"In Italian, with some letters to Donato Acciaiuoli in Latin",,,,
177+
"Middle French, with some sections in Latin (f. 77r-81v, 85v-87r, 93r-98r, 137v, 141r)",,,,
178+
"In Italian, Latin, and Spanish",,,,
179+
"Latin, with occasional Greek terms. Blank spaces left for other Greek terms",,,,
180+
Hebrew and Latin,,,,
181+
Aramaic,,,,
182+
"Latin, with a few words in Spanish (f. 7r)",,,,
183+
"Italian, with a few items in Latin",,,,
184+
"Latin, with one work in German (f. 167r-167v)",,,,
185+
"Latin, with Middle French notations in margins",,,,
186+
"Italian, in the Venetian dialect; list of indulgences in Latin (f. 56r-61v)",,,,
187+
"Italian, with colophon in Latin",,,,
188+
"Latin, with Low German prayers and interlinear translations interspersed throughout",,,,
189+
"English, with some notes in Latin and occasional words in Greek",,,,
190+
"Italian, with first item and some titles in Latin",,,,
191+
Dutch,,,,
192+
"Latin, with some Greek",,,,
193+
"Italian, with some Latin",,,,
194+
"Latin, with weekday devotions in Italian (f. 129-133)",,,,
195+
German with Latin calendar and colophon,,,,
196+
"Latin, with a few words in Greek in notes and passages in Greek in printed text",,,,
197+
Middle French and Latin,,,,
198+
"Mostly in German, with significant sections of the text in Latin. Some notations on the front and back end-leaves (dating from later than the text itself) are in Latin",,,,
199+
"Spanish, with some Latin",,,,
200+
Latin and Spanish,,,,
201+
"In Italian, with a few lines in Latin",,,,
202+
"In Latin, with a few words in Middle English",,,,
203+
Middle English and Latin,,,,
204+
"Latin (f. 1r-56v), with some Spanish (f. 78r-169v) and sections of German",,,,
205+
"Italian, with one note in Latin and Italian",,,,
206+
"Italian, Latin, and Spanish",,,,
207+
Latin with Spanish notes at the end,,,,
208+
"Arabic, with some Persian",,,,
209+
"Italian, with headings and occasional marginal notes in Latin",,,,
210+
"Latin and Italian, with a few words in Greek",,,,
211+
"Latin, except for one document in Italian",,,,
212+
Ethiopic,,,,
213+
"Latin, with a few notes in German",,,,
214+
"Latin, with words and names in Ancient Greek and Hebrew",,,,
215+
"Italian, in the Northeastern (Venetian?) vernacular",,,,
216+
"Latin, with first item in Italian (f. 2r-68r)",,,,
217+
"Latin, with the final poem in Italian (f. 16v)",,,,
218+
"Latin, with title and final words in Greek and notes inside the covers in Greek and Italian",,,,
219+
"German, with chapter and closing rubrics in Latin",,,,
220+
Latin with rubrics in German and later additions in Latin and German,,,,
221+
"Judeo-Arabic, with a brief work in Hebrew (f. 25r-39v); notes in Samaritan (f. 1v-7v, 21r-22v, 3 bifolia laid in after f. 22v, marginal notes as on f. 281v), Arabic (f. 1r, marginal notes on Samaritan bifolia), and Latin letters (f. 20v, 25v, 238r, 239v)",,,,
222+
"Italian, with a few copies in Latin and Spanish",,,,
223+
In Latin with extensive rubrics in German,,,,
224+
Middle English,,,,
225+
"Latin, with quotations in Greek",,,,
226+
gmh,,,,
227+
"Latin, with a few sentences in Italian",,,,
228+
"Latin, with a Hebrew alphabet and a few Hebrew words (f. 95v)",,,,
229+
"Latin, with calendar (f. 1r-12v), the Fifteen joys (f. 72v-75r), and a suffrage of Saint Bernard (f. 106r-106v) in Middle French",,,,
230+
"Latin, with later additions in German",,,,
231+
"In Latin, with some Hungarian and Turkish",,,,
232+
"Latin, with table of contents and later additions in Italian",,,,
233+
"Middle English, passages in Latin",,,,
234+
cze,,,,
235+
"English, with many works translated from Latin and retaining Latin titles",,,,
236+
French and German,,,,
237+
"Provençal, Occitan (modern Provençal), and one document in Latin",,,,
238+
"Latin, with French translation",,,,
239+
"English, with a few words in Latin",,,,
240+
"Middle High German, with one prayer in Latin (f. 8r)",,,,
241+
"Greek, with Greek and Latin annotations",,,,
242+
"Latin, with a few lines in German",,,,
243+
"In Latin, with the words of the bride and groom and the note of the bride's brother in Middle English",,,,
244+
Armenian,,,,
245+
"Latin, with some lines of Greek",,,,
246+
"Latin, with many 16th- and 17th-century entries and marginalia in Italian",,,,
247+
"Latin, with some Italian passages",,,,
248+
"Italian, with short introduction in Latin",,,,
249+
"Hebrew, with one interlinear glossed word in Latin (f. 54r)",,,,
250+
"Italian, with one prayer in Latin",,,,
251+
Italian; mottoes or epitaphs in Latin,,,,
252+
"Italian, with occasional phrases in Latin",,,,
253+
"Latin, with a few poems in Italian (f. 106r-108v)",,,,
254+
"In Latin, with brief endorsements in Middle English",,,,
255+
"Latin, with 23 letters from Gregory XI in French",,,,
256+
Ladino,,,,
257+
Middle High German,,,,
258+
"Latin, with Greek word ""telos"" (f. 24v)",,,,
259+
"Italian, last item in Latin",,,,
260+
"Mostly German, with benedictions in Latin (f. 22v-23r)",,,,
261+
"Italian, with a few words in Greek",,,,
262+
"Latin, with several items in Italian",,,,
263+
"In Italian, with some Latin",,,,
264+
"Arabic, with commentary in Ottoman Turkish",,,,
265+
"Italian, with short ""Missa"" text at end in Latin",,,,
266+
"Latin, with chart of Greek letters and diphthongs (f. 11r)",,,,
267+
"Latin, with a few marginal notes in French",,,,
268+
In Latin and Middle French,,,,
269+
"Latin, with passages in Spanish, Italian, Sicilian dialect, and Catalan",,,,
270+
In Spanish with two quotations in Latin (f. 3v),,,,
271+
"Spanish, with heading in Latin",,,,
272+
enm,,,,
273+
"In English, with endorsement in Latin",,,,
274+
"In Latin, with some Italian (f. 1r)",,,,
275+
"Middle High German, with some headings and sections in Latin",,,,
276+
"In Latin, with some documents in Italian and Spanish",,,,
277+
Latin with some rubrics in Middle English,,,,
278+
"German, with some letters in Latin",,,,
279+
"In Latin, with a few words in Greek",,,,
280+
"Plant names in Latin, with a few words in Italian (f. 2r-53v); texts about Morocco in English, Spanish, and Latin (54r-97v); glossary of words in Arabic, Spanish, and Latin (f. 98r-101r)",,,,
281+
"Predominant work in Hebrew (p. 19-234), with shorter works in Judeo-Arabic",,,,
282+
"Italian, in the Tuscan dialect",,,,
283+
"Provençal (f. 1-13, 25-26) and Old French (f. 14-23)",,,,
284+
"Latin, Middle French, and some words in Middle English (f. 214v)",,,,
285+
"Latin, with frequent Greek words",,,,
286+
"Italian, with a few words in Friulian (northeastern Italian) dialect",,,,
287+
Latin with German interlinear glosses and translations,,,,
288+
In Old French,,,,
289+
Middle English; some works in Latin,,,,
290+
"German, with Latin and Hebrew",,,,
291+
Latin (f. 1r-5r) and Italian with rubrics in Latin (f. 5v-20r),,,,
292+
Latin and Middle High German,,,,
293+
Latin and German (lower Alemannic with middle German idioms),,,,
294+
"Latin, with one later page in Italian (pasted-in note following f. 66)",,,,
295+
In Catalan,,,,
296+
"In Italian, with two leaves in Latin",,,,
297+
"In Italian, with 1 document each in Latin and Spanish",,,,
298+
"Italian, some of the letters are in Spanish",,,,
299+
In Middle English,,,,
300+
"English, with a fragment from a document in Latin (f. 76-78)",,,,
301+
"Latin, with articles and prayers in Polish",,,,
302+
"Latin, with one document in French",,,,
303+
"Italian, with a few words in Latin",,,,
304+
"Latin, with notes on original cover in Italian",,,,
305+
"Latin, with a few leaves in German",,,,
306+
"Old French, last section has some Latin",,,,
307+
"Latin, with marginal notes and accounts in French",,,,
308+
"Latin, with a few short sections in Italian (p. 169-174)",,,,
309+
In Valencian Catalan,,,,
310+
"Italian and Latin, with two documents in French",,,,
311+
"Italian, with three documents in Latin (f.7-8 and f.37)",,,,
312+
"Latin (f. 1r-77r, and table of contents, f. 88v) and French (f. 77v-88v)",,,,
313+
Latin and Old French,,,,
314+
"Italian, with one letter in Spanish",,,,
315+
"Condition in Middle English, obligation in Latin, and later summary in English",,,,
316+
Ge'ez,,,,
317+
Amharic,,,,
318+
English?,,,,
319+
Arabic or Persian,,,,
320+
English,,,,
321+
ara,,,,
322+
"Ottoman Turkish, Persian and Arabic",,,,
323+
"In Arabic, with one page in Persian",,,,
324+
Arabic and Persian,,,,
325+
In Arabic and Ottoman Turkish,,,,
326+
In Arabic,,,,
327+
Ottoman Turkish and Arabic,,,,
328+
"Arabic, Persian, and Ottoman Turkish",,,,
329+
ota,,,,
330+
"In Arabic, Ottoman Turkish and Persian",,,,
331+
Arabic and Urdu,,,,
332+
Pushto,,,,
333+
Arabic and Ottoman Turkish,,,,
334+
Urdu,,,,
335+
"Arabic, Persian and Ottoman Turkish",,,,
336+
Turkmen in Arabic script,,,,
337+
urd,,,,
338+
Chagatai,,,,
339+
In Chagatai,,,,
340+
"Arabic, Ottoman Turkish and Persian",,,,
341+
spa,,,,
342+
dut,,,,

0 commit comments

Comments
 (0)