Skip to content

Data corrections (Gender) #34

@atiro

Description

@atiro

I'm writing an article on use of Pandas for data analysis in museum collections data, and have been making great use of yours and MoMa's datasets for this, for which many thanks.

One of the things I wanted to use Pandas for was find errata in the data, comparing common fields between the datasets. After a bit of data mangling, the following differences in the artist gender were flagged up.

Gender incorrect (according to MoMa)

Artist Tate Gender MoMa Gender
Misch Kohn Female Male

Gender unknown (but known by MoMa)

Artist Tate Gender MoMa Gender
Shusaku Arakawa NaN Male
Juan Downey NaN Male
Jack Goldstein NaN Male
Lawrence Abu Hamdan NaN Male
Vladimir Kozlinskii NaN Male
Vladimir Mayakovsky NaN Male
Melik Ohanian NaN Male
R. H. Quaytman NaN Female
Monika Sosnowska NaN Female

Likewise have differences for artists birth & death dates, will lodge these as separate issues if of use.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions