September 23, 2015
acq_no | artist_1 | artist_2 | date | medium | tags |
---|---|---|---|---|---|
1999.32 | Studio of Rembrandt | possibly Govaert Flinck | after 1636 | oil on canvas | religious, portrait |
1908.54 | Jan Vermeer | NA | c. 1650 | oil on panel | genre, woman |
1955.32 | Vermeer, Jan | NA | c. 1655 | oil on canvas | woman, window, painting |
1955.33 | Hals, Frans | NA | 16220 | oil on canvas, relined | merry company |
acq_no | artist_1 | artist_2 | date | medium | tags |
---|---|---|---|---|---|
1999.32 | Studio of Rembrandt | possibly Govaert Flinck | after 1636 | oil on canvas | religious, portrait |
1908.54 | Jan Vermeer | NA | c. 1650 | oil on panel | genre, woman |
1955.32 | Vermeer, Jan | NA | c. 1655 | oil on canvas | woman, window, painting |
1955.33 | Hals, Frans | NA | 16220 | oil on canvas, relined | merry company |
artist_1
: first and last names combined; conditionalsdate
: conditionalsmedium
: medium and support combined; conditionalsacq_no | artist_1 | artist_2 | date | medium | tags |
---|---|---|---|---|---|
1999.32 | Studio of Rembrandt | possibly Govaert Flinck | after 1636 | oil on canvas | religious, portrait |
1908.54 | Jan Vermeer | NA | c. 1650 | oil on panel | genre, woman |
1955.32 | Vermeer, Jan | NA | c. 1655 | oil on canvas | woman, window, painting |
1955.33 | Hals, Frans | NA | 16220 | oil on canvas, relined | merry company |
tags
and artists
need to be their own tablesacq_no | artist_1 | artist_2 | date | medium | tags |
---|---|---|---|---|---|
1999.32 | Studio of Rembrandt | possibly Govaert Flinck | after 1636 | oil on canvas | religious, portrait |
1908.54 | Jan Vermeer | NA | c. 1650 | oil on panel | genre, woman |
1955.32 | Vermeer, Jan | NA | c. 1655 | oil on canvas | woman, window, painting |
1955.33 | Hals, Frans | NA | 16220 | oil on canvas, relined | merry company |
date
has typosdate
could potentially be numericThree conceptual tables: objects, artist links, and tags.
acq_no | date | date_qual | medium | support | cons_note |
---|---|---|---|---|---|
1999.32 | 1636 | after | oil | canvas | NA |
1908.54 | 1650 | circa | oil | panel | NA |
1955.32 | 1655 | circa | oil | canvas | NA |
1955.33 | 1620 | NA | oil | canvas | relined |
acq_no | name | qualification |
---|---|---|
1999.32 | Rembrandt | studio of |
1999.32 | Govaert Flinck | possibly |
1908.54 | Jan Vermeer | NA |
1955.32 | Jan Vermeer | NA |
1955.33 | Frans Hals | NA |
acq_no | tag |
---|---|
1999.32 | religious |
1999.32 | portrait |
1908.54 | genre |
1908.54 | woman |
1955.32 | woman |
1955.32 | window |
1955.32 | painting |
1955.33 | merry_company |
By separating out tables, we can be more flexible with the types of questions we can ask.
# What paintings have the tag "woman"? clean_data_obj %>% inner_join(clean_data_tags, by = "acq_no") %>% filter(tag == "woman") %>% kable()
acq_no | date | date_qual | medium | support | cons_note | tag |
---|---|---|---|---|---|---|
1908.54 | 1650 | circa | oil | panel | NA | woman |
1955.32 | 1655 | circa | oil | canvas | NA | woman |
# When was each painter working? clean_data_obj %>% inner_join(clean_data_artist, by = "acq_no") %>% filter(is.na(qualification) | qualification != "studio of") %>% ggplot(aes(x = name, y = date, color = date_qual)) + geom_point(size = 5)
1701-01-01
, or have separate columns for year, month, day.start_date
and end_date
citation
) then you probably need to make a new column just for that information.[?]
is not informative. What about: illegible
, unsubstantiated
, approximate
?date
, date_uncertainty
Omeka and the like are great for interactively communicating stories and selections from your research. They are not a storage and dissemination solution in the long run. Costly, fragile, limiting.
"Flat files" are decoupled from running software, and can be opened by a regular text editor. Someone without your original software has a better chance of recovering the information.
.txt
, .csv
, not .xslx
)
UTF-8
so that accents & special characters are preserved