Skip to content

Commit c7ffa36

Browse files
committed
2 parents 61a212d + 66c1d2b commit c7ffa36

1 file changed

Lines changed: 22 additions & 1 deletion

File tree

‎README.md‎

Lines changed: 22 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1,22 @@
1-
# ds-xml
1+
# DS XML
2+
3+
XML exports from https://catalog.digital-scriptorium.org
4+
5+
These files are split to avoid GitHub requiring [`git-lfs`](https://docs.github.com/en/repositories/working-with-files/managing-large-files/about-git-large-file-storage) for files over 50MB.
6+
7+
The files are gzipped and split using the `split` command:
8+
9+
```
10+
$ gzip export.xml
11+
# => export.xml.gz
12+
# split files into 8MB chunks:
13+
$ spit -b 8m export.xml.gz export.xml.gz_
14+
# => export.xml.gz_aa, export.xml.gz_ab, export.xml.gz_ac, export.xml.gz_ad, ...
15+
```
16+
17+
18+
To reassemble and decompress the files use the `cat` command:
19+
20+
```
21+
$ cat export.xml.gz_* | gunzip -c > export.xml
22+
```

0 commit comments

Comments
 (0)