The idea

We have selected an idea from the Battle of Waterloo Wikipedia page. Our idea involves Napoleone Bonaparte as person, Waterloo as place, Defeat as concept and 1815 as Date. 

E-R diagram

In this model we have: entities, which are represented as rectangles and relationships represented as rhombus

img/er.jpg

The E-R diagram of our idea.

The set of items

In this section are listed the items that will be described following the LAM guidelines in the following sections. Is very important to say that a great number of the retrieved items are provided without metadata.

  1. Rendigote et bicorne de Napoléon Ier, Artifact [Musée de l'Armée]

  2. Letter of surrender from Napoleon to the Prince Regent, 13 July 1815, Text [Royal Collection Trust]
  3. The Emperor Napoleon in His Study at the Tuileries, Painting [Google Arts & CultureNational Gallery of Art]

  4. Plan de la bataille de Waterloo ou mont St-Jean. 18 juin 1815, Artifact [Gallica]

  5. The Field of Waterloo, Painting [Google Arts & Culture / British Museum]

  6. Cuirasse de carabinier Fauveau, Artifact [Musée de l'Armée]

  7. Carte di Pasotti Francesco, Collection of texts [Archivio storico di Imola]

  8. Europe after the Congress of Vienna, Artifact [Library of Congress]

  9. Tagebuch vom Wiener Kongreß, Text [DEUTSCHEN NATIONALBIBLIOTHEK]

  10. The 28th Regiment at Quatre Bras, Painting [Google Arts & Culture / National Gallery of Victoria]

The standard and alignment

The alignment below is between different metadata standard related to people, places, dates and concepts, used by cultural heritage institutions.

img/alignement.png

Alignment of the metadata used to describe the found items.

Theoretical model

In order to develop our theoretical model, the idea is to generate it starting from categories and question on them. The answers will be opportunely documentated with the right properties from the right ontologies in  .

Who (people)

Here the main problem is to explicitly declare the different roles of a single person that, in turn, can be viewed as single person, family or corporate body. Anyway, a single "person" can be referred as "the creator", "the user" or "the owner" of an item. For these reasons we think that EAC-CPF and FOAF can be used for our purposes.

Our thoughs were start with FOAF: Friend Of A Friend Ontology, we have some useful properties to describe real people (eg: Napoleone Bonaparte) and some useful general information about the person. It answer to some basic questions like: "What is the gender?" and other questions that aim to get the personal information of a person. But in our case we need informations about entities, not only real person, so we choose not to use FOAF.

EAC-CPF, from the archival context, is very useful in order to differentiate the three main types of but the most important thing is the role that a person has in relation of the item. Using EAC-CPF we can overpass this lack and add a very useful information (eg: here the questions are "What is the role of this person, releated to the item?" or "Is this person the owner of the item? If yes, what kind of owner is it?"). So, the way is to use EAC-CPF.

Where (places)

In our model the idea of a place is quite simple. For our purposes we need just some minor information about a place to represent it with a name (or alternative names eg: Waterloo in French, Waterlô in Wallon  ), a country (represented as a country code eg: IT for Italy, FR for France) and finally the couple of latitude and longitude. All of these needs can be handled using the GeoNames Ontology. The choice to use GeoNames is pushed mainly because it is more a De Facto standard.

When (dates)

For what concern the dates, the only question that we posed ourself is: "In which format I need to express the notion of time?". Starting from this point we posed another question i.e. "Do we need the notion of time? Or we just need the notion of date?". 

Because our idea is cataloged under an historical theme, we do not care about the time, we can't even find the time on the web (eg we can't answer to the question "At what time in the morning the Battle of Waterloo has started?"), so our focus is just on dates. The format we need is defined by the w3Consortium under the Date and Time formats, and in particular the format ​DD/MM/YYYY proper of Italy.

What (subject/concept)

The first question we raised is: "what we want to describe?" and we tought that the best way to do it, is to describe the concept related to the item. Of course, our model has also to describe the subject/concept proper of an item. Our mission, here, is to describe the item and the concept expressed by the item itself. We raised some questions on this matter, starting from "What is the name of the concept?", "What type of entity is this?", "What is the subject or theme of item?" or "in which category the concept belongs?" (​General belongsToCategory Army).

Some of these questions are general and fully handled with the Dublin Core Metadata Initiative and RDF(S). The DC standard is the most used because: it is the most functional, its properties are very general and it grants a very high level of abstraction. SKOS is a standard that easily define what exaclty is a concept, so we adopted it in order to give the right definition of concept. CIDOC-CRM is the best way to describe, in a general way, people linked to the concept. FOAF, instead, allows us to introduce how a concept find its material representation (following the idea of DBPedia).

Conceptual model

In this section, we present the conceptual model that has to answer the previous questions and then wrap data. For each type of class describable, here we define all the right properties, from the ontologies, filling up the following tables.

Person

As we already describe in the theoretical model ( ) for answering the question we are going to use EAC-CPF. In the properties table will be used the namespace ​eac-cpf defined. A little note here: we need to add more information about the type of relations adding the description property given by Dublin Core (and the properties defined by DCTERMS). 

xmlns:eac-cpf="http://archivi.ibc.regione.emilia-romagna.it/ontology/eac-cpf#" 
xmlns:dc="http://dublincore.org/2012/06/14/dcelements"
xmlns:dcterms="http://purl.org/dc/terms/"

img/who(final).png

This image is the description of the conceptual model we defined for answering the questions. For each of them, we describe which property better describe it.

In the following list, the meaning of each property is described:

  1. eac-cpf:entity allows us to define what we are exactly describing (corporate body, person or family). This question makes our model adaptable to all situations.

  2. eac-cpf:nameEntry simply defines the name of the entity.
  3. eac-cpf:chronItem (eac-cpf:date, eac-cpf:place, eac-cpf:event). It can describe the chronological extremes of the entity, but it could also describe particular moments of existence. This system has been preferred to BioOntology, because Bio is closely related to the description of individuals (i.e. a corporate body has its foundation, not exactly a birthday)
  4. eac-cpf:biogHist is used to have a brief description of the entity.
  5. eac-cpf:resourceRelation (eac-cpf:resourceRelationType, dc:description, dcterms:relation, dc:date). Describes a relationship between the entity and a resource. We wanted to offer the possibility to add historical details to the relationship using the property dc:description.

Date

The date is described in the format proposed by the w3 Consortium. In particular, for what concern our case, we opted for the following implementation: DD/MM/YYYY​. This is the proper format used in Central and South Europe, in particular in Italy.  

Place

We choose to describe places with the GeoNames Ontology and the Basic Geo (WGS84 lat/long) Vocabulary. In fact, for answering at the questions we should define the following namespaces.

xmlns:gn="http://www.geonames.org/ontology#"
xmlns:wgs84_pos="http://www.w3.org/2003/01/geo/wgs84_pos#"​
Question Property

What is the name?

gn:name

There are other names used to describe the place? 

gn:alternativeName [optional]

How can we explicitly define the Country?

gn:countryCode 

What is the latitude?

wgs84_pos:lat

What is the longitude?

wgs84_pos:long

Table of questions and answers. The answers are the properties of the used vocabularies.

Subject/concept

For answering the questions raised during the theoretical model development we have used the DC, RDF(S)CIDOC-CRM and FOAF ontologies. defined as follows:

​xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" 
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" 
xmlns:crm="http://www.cidoc-crm.org/cidoc-crm/" 
xmlns:dc="http://purl.org/dc/elements/1.1/" 
xmlns:foaf="http://xmlns.com/foaf/0.1/"

img/what(final).png

How our conceptual model aims to describe the subject/concept.

Describe items using our model

In this section we are going to describe some items, using natural language, using our model. We have created a shared folder in Google Drive containing all the needed files here.

Item Person Subject/Concept Place

Letter of surrender

Napoleon

Surrender

Rochefort

Chapeau

Napoleon

General

Waterloo

Armour

Fauveau

Killed

Waterloo

The items stored in this table are devided in three categories.

Describe an entity using RDF

We created URIs for 3 different entities and we described each of them in RDF/XML, one of the possibile serializations of RDF: 

  1. PERSON: Napoleone Bonaparte (RDF/XML), described with EAC-CPF;

  2. PLACE: Rochefort (RDF/XML), described with GeoNames;
  3. CONCEPT/SUBJECT: Surrender of Napoleon (RDF/XML), described with CIDOC-CRM and DC.

Connecting data to other related items

TODO

References