Skip to content

Commit 946ae8b

Browse files
committed
Add workflow files for Apr 4 '22 data set
1 parent ee28135 commit 946ae8b

3 files changed

Lines changed: 2163 additions & 0 deletions

File tree

‎workflow/2022-04-04-README.md‎

Lines changed: 41 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,41 @@
1+
# Combined prototype data 2022-04-04
2+
3+
Second set of prototype data for ingest; represent first full iteration of
4+
extraction workflow.
5+
6+
## List of institutions
7+
8+
- Q1079140, Indiana University, Bloomington; partial data, prototype import
9+
- Q1093910, City College of New York; partial data, prototype import
10+
- Q1501676, General Theological Seminary; partial data, prototype import
11+
- Q168756, University of California, Berkeley; partial data, prototype import
12+
- Q1976985, Nelson-Atkins Museum of Art; partial data, prototype import
13+
- Q20745482, Providence Public Library; partial data, prototype import
14+
- Q21578, Princeton University; partial data, prototype import
15+
- Q30257935, Conception Abbey and Seminary; partial data, prototype import
16+
- Q3087288, Free Library of Philadelphia; partial data, prototype import
17+
- Q49088, Columbia University; partial data, prototype import
18+
- Q49115, NIC; partial data, prototype import
19+
- Q49117, University of Pennsylvania; partial data, prototype import
20+
- Q49205, Wellesley College; partial data, prototype import
21+
- Q49210, New York University; partial data, prototype import
22+
- Q499451, Rutgers, The State University of New Jersey; partial data, prototype import
23+
- Q5021042, State of California; partial data, prototype import
24+
- Q5174002, Grolier Club; partial data, prototype import
25+
- Q52413, University of Kansas; partial data, prototype import
26+
- Q63969940, Burke Library, Columbia University; partial data, prototype import
27+
- Q766145, University of Oregon; partial data, prototype import
28+
29+
## Extraction notes
30+
31+
Adds Kansas. Also adds more extraction of several column types not in previous
32+
version; notably genre columns are split based on source vocabulary (LC terms,
33+
AAT, etc.)
34+
35+
## Enrichment notes
36+
37+
Full enrichment process represented.
38+
39+
## Import notes
40+
41+
Imported April 2022.

0 commit comments

Comments
 (0)