-
Notifications
You must be signed in to change notification settings - Fork 0
/
index.html
253 lines (213 loc) Β· 24.8 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
<!doctype html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>A Beginner's Guide to Metadata</title>
<meta name="description" content="A Beginner's Guide to Metadata">
<meta name="author" content="Astun Technology">
<meta name="apple-mobile-web-app-capable" content="yes">
<meta name="apple-mobile-web-app-status-bar-style" content="black-translucent">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<link rel="stylesheet" href="dist/reset.css">
<link rel="stylesheet" href="dist/reveal.css">
<link rel="stylesheet" href="dist/theme/black.css" id="theme">
<link rel="stylesheet" href="custom/astun.css">
<!-- <link rel="stylesheet" href="custom/custom.css"> -->
<!-- Theme used for syntax highlighting of code -->
<link rel="stylesheet" href="plugin/highlight/monokai.css">
</head>
<body>
<div class="reveal">
<!-- Any section element inside of this container is displayed as a slide -->
<div class="slides">
<section>
<h3 style="color:var(--astun-indigo)">A Beginner's Guide to Metadata</h3>
<p>
<small><a href="http://astuntechnology.com">Jo Cook | Astun Technology</a></small>
</p>
</section>
<section data-background-image="./images/wordcloud.svg" alt="cans" class="full" data-background-size="contain">
<h2 class="box" style="background-color:var(--astun-red)">Hello! π</h2>
<p class="box" style="background-color:var(--astun-red)">Welcome to a talk by the Data Discoverability team at Astun Technology</p>
<aside class="notes">We work with GeoNetwork and other technologies to make it as easy as possible for people to create good-quality metadata without being overwhelmed by what is without a doubt a complex and technical subject. Over the next 45 minutes I'm going to outline what metadata is, why it's important, highlight some of the jargon you might come across, and set out some easy steps you can undertake to get started</aside>
</section>
<section data-background-image="./images/cans.jpg" alt="unlabelled" class="full" data-background-size="cover">
<h3 class="box" style="background-color:var(--astun-red)">What is metadata?</h3>
<img class="fragment box" style="background-color:var(--astun-red);margin-top: 50px !important" src="./images/chatgpt-simple.png"/>
<p class="attribution" style="color:white">Pixnio CCO</p>
<aside class="notes">Metadata is usually described as data about data, but that doesn't really tell you much. I asked my good friend (on the internet) ChatGPT for advice.</aside>
</section>
<section data-background-image="./images/covid-map.jpeg" alt="covid map on screen" class="full" data-background-size="cover">
<h3 class="box" style="background-color:var(--astun-orange)">Why is it important?</h3>
<img class="fragment box" style="background-color:var(--astun-orange);margin-top: 50px !important" src="./images/chatgpt-important.png"/>
<p class="notes">So chatGPT knows that metadata is important for making data management more effective, secure, and efficient</p>
<p class="attribution" style="color:white">Adobe Stock Photo</p>
</section>
<section data-background-image="./images/mashups.png" alt="granny mashup" class="full" data-background-size="800px 600px" data-background-position="bottom 60px">
<h3 class="box" style="background-color:var(--astun-teal);margin-top: 50px !important">But it also...</h3>
<p class="fragment box-left fade-in-then-out" style="background-color:var(--astun-teal);margin-top: 50px !important">Reduces risk and cost</p>
<p class="fragment box-left fade-in-then-out" style="background-color:var(--astun-teal);margin-top: 250px !important">Adds value, and encourages innovation</p>
<p class="fragment box-right fade-in-then-out" style="background-color:var(--astun-teal);margin-top: 250px !important">and helps others understand the data</p></p>
<p class="fragment box-right fade-in" style="background-color:var(--astun-teal);margin-top: 250px !important"><a target="_blank" href="https://coronavirus.data.gov.uk/search?postcode=LA1+4UR">π Serious example</a></p>
<p class="attribution" style="color:grey">The Internet</p>
<aside class="notes">COVID- we all got pretty good at deciding whether case data was useful or not. For this sort of dataset you need to know its getting daily updates. Metadata reduces the amount of time people need to search for datasets, and reduces the likelihood of their choosing the wrong dataset for the task at hand. A good catalogue will also be optimised for discovery, for humans and search engines, and will include user statistics and feedback options, helping data providers know which datasets are being used, and providing an easy way for people to report problems.</aside>
</section>
<section data-background-image="./images/facepalm.jpg" alt="facepalm" class="full" data-background-size="cover">
<h3 class="box" style="background-color:var(--astun-green)">However...</h3>
<img class="fragment box" style="background-color:var(--astun-green);margin-top: 50px !important" src="./images/chatgpt.png"/>
<p style="color:white" class="attribution">Alex Proimos, Flickr CC BY-NC 2.0</p>
<aside class="notes">As ChatGPT said, metadata is complex, difficult to organise, with many different standards, formats and terminology. Hopefully we can make some of this easier to understand in the rest of this talk!</aside>
</section>
<section data-background-image="./images/alien.jpg" alt="alien" class="full" data-background-size="cover">
<h2 class="box" style="background-color:var(--astun-red);margin-top: 250px !important">Some jargon</h2>
<p class="attribution" style="color:white">The Internet</p>
<aside class="notes">In this section, we'll cover some of the most common acronyms and terms that you might hear when working with metadata, hopefully with a simple explanation</aside>
</section>
<section data-background-image="./images/xml-tshirts.jpg" alt="xml t-shirt" class="full" data-background-size="contain">
<h3 class="box-left" style="background-color:var(--astun-teal);margin-top: 250px !important">e<strong style="color:var(--astun-red)">X</strong>tensible <strong style="color:var(--astun-red)">M</strong>arkup <strong style="color:var(--astun-red)">L</strong>anguage</h3>
<p class="fragment box-right fade-in-then-out" style="background-color:var(--astun-teal);margin-top: 250px !important">The most common format in which metadata is provided</p>
<p class="fragment box-right fade-in-then-out" style="background-color:var(--astun-teal);margin-top: 250px !important">Both a language and a file format</p>
<p class="fragment box-right fade-in-then-out" style="background-color:var(--astun-teal);margin-top: 250px !important">Takes the form of <strong style="color:var(--astun-red)">elements</strong>, surrounded by <strong style="color:var(--astun-red)">start</strong> and <strong style="color:var(--astun-red)">end tags</strong> that describe what the element is and what type of thing it is</p>
<p class="fragment box-right fade-in" style="background-color:var(--astun-teal);margin-top: 250px !important"><a target="_blank" href="https://spatialdata.gov.scot/geonetwork/srv/api/records/55587e7d-c9ef-46bd-8670-1452d3666f5c/formatters/xml?approved=true">π XML metadata</a></p>
<p class="attribution" style="color:grey">Redbubble</p>
<aside class="notes">Metadata is often supplied in xml format because of the flexibility and extensibility of the format. It's both machine and human readable (with practice). A good metadata catalogue will shield you from most of the complexity</aside>
</section>
<section data-background-image="./images/xml-tshirts2.jpg" alt="xml t-shirt" class="full" data-background-size="contain">
<h3 class="box-left" style="background-color:var(--astun-teal);margin-top: 350px !important">XML rules and validity</h3>
<p class="fragment box-right fade-in-then-out" style="background-color:var(--astun-teal);margin-top: 350px !important">Rules on what should be in a metadata record are defined by <strong style="color:var(--astun-red)">schemas</strong></p>
<p class="fragment box-right fade-in-then-out" style="background-color:var(--astun-teal);margin-top: 350px !important">A record can re-use elements from many different schemas using <strong style="color:var(--astun-red)">namespaces</strong></p>
<p class="fragment box-right fade-in-then-out" style="background-color:var(--astun-teal);margin-top: 350px !important">A valid record must be both <strong style="color:var(--astun-red)">well-formed</strong> and <strong style="color:var(--astun-red)">schema-valid</strong></p>
<p class="fragment box-right fade-in" style="background-color:var(--astun-teal);margin-top: 350px !important"><a target="_blank" href="https://spatialdata.gov.scot/geonetwork/srv/api/records/55587e7d-c9ef-46bd-8670-1452d3666f5c/formatters/xml?approved=true">π That record again</a></p>
<p class="attribution" style="color:grey">Redbubble</p>
<aside class="notes">Schemas are basically frameworks for what should be included when creating metadata. To avoid re-inventing the wheel, schemas will use elements from other schemas to describe certain concepts. The different schemas are referred to in the xml using the namespaces. Valid metadata must be both xml-valid (so no dangling tags, for instance) and schema-valid. There are a number of other concepts, such as XSL(T) for transforming XML, XPath for referring to a particular bit of XML in a document, and XQuery for doing SQL-like queries. These are used extensively within metadata catalogues such as GeoNetwork to control the display of records on the page, and to allow updates to records.</aside>
</section>
<section data-background-image="./images/bored_dog.png" alt="dall-e bored dog" class="full" data-background-size="contain">
<h3 class="box" style="background-color:var(--astun-teal)">Schemas, Standards and Profiles</h3>
<p class="fragment box-right fade-in-then-out" style="background-color:var(--astun-teal);margin-top: 250px !important"><strong style="color:var(--astun-red)">Schema:</strong> specifies the structure of the metadata, the elements, and the controlled vocabularies used</p>
<p class="fragment box-right fade-in-then-out" style="background-color:var(--astun-teal);margin-top: 250px !important"><strong style="color:var(--astun-red)">Standard:</strong> a schema that has been developed and maintained by a standards organisation</p>
<p class="fragment box-right fade-in-then-out" style="background-color:var(--astun-teal);margin-top: 250px !important"><strong style="color:var(--astun-red)">Profile:</strong> an extension to a standard, or an implementation of it in software, or both</p>
<img class="fragment box-left fade-in-then-out" style="background-color:var(--astun-teal);margin-top: 25px !important;height:400px; width:400px;" src="./images/iso19139.png">
<aside class="notes">You'll hear all of these used interchangeably when people talk about metadata. Generally speaking, a profile can only be more strict than the standard it extends. This might be by making an element mandatory when the core standard says it's optional, or by adding a controlled vocabulary. Elements from other standards can be added as long as they don't conflict.</aside>
</section>
<section data-background-image="./images/sleeping-dog.png" alt="dall-e bored dog" class="full" data-background-size="contain">
<h3 class="box" style="background-color:var(--astun-teal)">ISO Standards and Gemini</h3>
<span class="fragment box-right fade-in-then-out twocolumn" style="background-color:var(--astun-teal);width:60%">
<p>The core standard for Geospatial metadata is <strong style="color:var(--astun-red)">ISO19115</strong></p>
<img src="./images/bsi_iso19115.png"/></span>
<span class="fragment box-right fade-in-then-out twocolumn" style="background-color:var(--astun-teal);width:60%">
<p><strong style="color:var(--astun-red)">ISO19139</strong> is the implementation of ISO19115 in XML</p>
<img src="./images/bsi_iso19139.png"/></span>
<span class="fragment box-right fade-in-then-out twocolumn" style="background-color:var(--astun-teal);width:60%">
<p><strong style="color:var(--astun-red)">GEMINI</strong> is an extension to ISO19139 for UK geospatial metadata</p>
<img src="./images/agi_gemini.png"/></span>
<span class="fragment box-right fade-in" style="background-color:var(--astun-teal);margin-top: 250px !important"><p><a target="_blank" href="https://www.agi.org.uk/uk-gemini/">π Gemini Online</a></p></span>
<aside class="notes">There are a couple of official ISO standards that you'll commonly encounter. Generally you can just focus on UK Gemini, where the implementation work has been done for you. It adds in some UK-specific terms and code lists, reduces some choices (such as language), and adds some elements from other standards such as ISO19119 for services</aside>
</section>
<section data-background-image="./images/Pineapple.jpg" alt="pineapple" class="full" data-background-size="cover">
<h3 class="box" style="background-color:var(--astun-green)">What about INSPIRE?</h3>
<p class="fragment box-left fade-in-then-out" style="background-color:var(--astun-green)">Yes, you still have to do it</p>
<p class="fragment box-left fade-in" style="background-color:var(--astun-green)">#1 Identify key environmental datasets</p>
<p class="fragment box-right fade-in" style="background-color:var(--astun-green)">#2 Provide metadata in Gemini format</p>
<p class="fragment box-left fade-in" style="background-color:var(--astun-green);margin-top: 250px !important">#3 Make data available via web services</p>
<p class="fragment box-right fade-in" style="background-color:var(--astun-green);margin-top: 250px !important">#4 Publish to data.gov.uk</p>
<p style="color:white" class="attribution">Unsplash CC0 1.0</p>
<aside class="notes">INSPIRE was an EU regulation for the sharing of environmental spatial data, originally implemented in 2007, and brought over into UK law after Brexit. So yes, you still have to do it, but it's easy as a valid Gemini record is also INSPIRE-compliant, you simply have to include a keyword from the INSPIRE spatial data themes</aside>
</section>
<section data-background-image="./images/intersection.png" alt="venn diagram" class="full" data-background-size="1000px 750px" data-background-position="bottom">
<h3 class="box" style="background-color:var(--astun-teal)">The intersection with Linked Data</h3>
<p class="fragment box-left fade-in" style="background-color:var(--astun-teal);width: 250px !important">URIs</p>
<p class="fragment box-left fade-in" style="background-color:var(--astun-teal);width: 250px !important;margin-top: 250px !important;">Dublin Core/DCAT</p>
<p class="fragment box-right fade-in" style="background-color:var(--astun-teal);width: 250px !important">RDF/Triples</p>
<p class="fragment box-right fade-in" style="background-color:var(--astun-teal);margin-top: 250px !important;width: 250px !important">Ontologies</p>
<aside class="notes">If metadata is data about data, linked data is data structured so that it can be easily connected and integrated with other data. Linked data uses standard web technologies such as URIs (universal resource locators, of which a web address or URL is a subset) to identify and make data available in a machine-readable way. Other terms that you might come across at the intersection of metadata and linked data are: Dublin Core is a basic, domain-agnostic standard for describing any sort of resource. DCAT and (Geo)DCAT-AP are profiles of Dublin Core, widely used in catalogues such as CKAN. RDF is a data model that comprises "triple statements", comprising a subject, a predicate or relationship, and an object, and are widely used for linked data and semantic web implementations. The Dublin in Dublin Core relates to Dublin, Ohio rather than Dublin, Ireland, and that is a triple statement where the subject is "Dublin Core", the predicate is "relates to", and the object is "Dublin Ohio". An ontology is a fancy word for a set of categories or concepts within a domain. A good metadata catalogue will allow you to publish and ingest metadata in both ISO19139 and DCAT formats.</aside>
</section>
<section data-background-image="./images/swiss-army-knife.jpg" alt="venn diagram" class="full" data-background-size="cover">
<h3 class="box" style="background-color:var(--astun-red)">π€ Are metadata catalogs just for spatial data?</h3>
<p class="fragment box-left fade-in" style="background-color:var(--astun-red)">No! Other uses include...</p>
<p class="fragment box-right fade-in" style="background-color:var(--astun-red)">Non-spatial datasets...</p>
<p class="fragment box-left fade-in" style="background-color:var(--astun-red);margin-top: 250px !important">Detailed workflow recording...</p>
<p class="fragment box-right fade-in" style="background-color:var(--astun-red);margin-top: 250px !important">GDPR records...</p>
<p class="attribution" style="color:white">James Case, Flickr CC0 1.0</p>
<aside class="notes">The core ISO19139 standard allows the use of elements from many other standards, so you can extend records to store additional information to meet your needs, in a structured, standards-compliant, interoperable way.</aside>
</section>
<section data-background-image="./images/funfair.jpg" alt="funfair queue" class="full" data-background-size="cover">
<h3 class="box" style="background-color:var(--astun-teal)">What is Q-FAIR all about?</h3>
<p class="fragment box-left fade-in" style="background-color:var(--astun-teal)"><strong style="color:var(--astun-red)">Q</strong>uality</p>
<p class="fragment box-right fade-in" style="background-color:var(--astun-teal)"><strong style="color:var(--astun-red)">-</strong> is just a hyphen</p>
<p class="fragment box-left fade-in" style="background-color:var(--astun-teal);margin-top: 125px !important"><strong style="color:var(--astun-red)">F</strong>indable</p>
<p class="fragment box-right fade-in" style="background-color:var(--astun-teal);margin-top: 125px !important"><strong style="color:var(--astun-red)">A</strong>ccessible</p>
<p class="fragment box-left fade-in" style="background-color:var(--astun-teal);margin-top: 250px !important"><strong style="color:var(--astun-red)">I</strong>nteroperable</p>
<p class="fragment box-right fade-in" style="background-color:var(--astun-teal);margin-top: 250px !important"><strong style="color:var(--astun-red)">R</strong>eusable</p>
<p class="attribution" style="color:white">Copyright Eoin Sharkey</p>
<aside class="notes">Q-FAIR is all about making your data high-quality, findable, accessible, interoperable and reusable. This is best (or most easily) achieved if your metadata is also Q-FAIR, and also best achieved using open standards and formats where possible.</aside>
</section>
<section data-background-image="./images/kitten-hiding.jpg" alt="kitten hiding" class="full" data-background-size="cover">
<h2 class="box" style="background-color:var(--astun-teal)">Where to start?</h2>
<p class="fragment box-left fade-in" style="color:white;background-color:var(--astun-teal)">Get as much buy-in as possible</p>
<p class="fragment box-left fade-in" style="color:white;background-color:var(--astun-teal);margin-top: 250px !important">Tools such as Data Maturity Models help assess your current level</p>
<p class="fragment box-right fade-in" style="color:white;background-color:var(--astun-teal)">Frame it in terms of getting the best value out of your assets</p>
<p class="fragment box-right fade-in" style="color:white;background-color:var(--astun-teal);margin-top: 250px !important">But take it one step at a time!</p>
<p style="color:grey" class="attribution">Unsplash CC0 1.0</p>
<aside class="notes">After the jargon, here are some simple tips for how to get started. Go back to the benefits of data sharing to get as much buy-in as possible, and use tools such as Data Maturity Models to identify where you should focus time and energy. It can be done a bit at a time though, and here are some simple ways to get started</aside>
</section>
<section data-background-image="./images/keep-it-simple.jpg" alt="simple lego" class="full" data-background-size="cover">
<h3 class="box" style="background-color:var(--astun-teal)">πΈ Novice Level</h3>
<p class="fragment box-left fade-in" style="color:white;background-color:var(--astun-teal)">Keep it simple</p>
<p class="fragment box-right fade-in" style="color:white;background-color:var(--astun-teal)">Stick to a structure</p>
<p class="fragment box-left fade-in" style="color:white;background-color:var(--astun-teal);margin-top: 250px !important">Metadata in a spreadsheet is better than no metadata</p>
<p class="fragment box-right fade-in" style="color:white;background-color:var(--astun-teal);margin-top: 250px !important">Choose a few key datasets to start with</p>
<p class="attribution" style="color:white">elPadawan, Flickr CC BY-SA 2.0</p>
<aside class="notes">Recording a small number of key metadata elements for your datasets in a simple spreadsheet is a good way to start. It's always possible to script the insertion of them into a catalogue if you choose to</aside>
</section>
<section data-background-image="./images/lego-ninja.jpg" alt="lego ninja" class="full" data-background-size="cover">
<h3 class="box" style="background-color:var(--astun-green);margin-top: 50px !important">πͺ Next Level</h3>
<p class="fragment box-left fade-in" style="color:white;background-color:var(--astun-green)">Don't reinvent the wheel</p>
<p class="fragment box-right fade-in" style="color:white;background-color:var(--astun-green)">Adopt open solutions that aid sharing and discovery</p>
<p class="fragment box-left fade-in" style="color:white;background-color:var(--astun-green);margin-top: 250px !important">Always be Q-FAIR</p>
<p class="fragment box-right fade-in" style="color:white;background-color:var(--astun-green);margin-top: 250px !important">Don't let perfect be the enemy of good</p>
<p class="attribution" style="color:white">wiredforlego, Flickr CC BY-SA 2.0</p>
<aside class="notes">There are many metadata solutions out there, you don't need to craft your own. To make your life, and those of people using your data, choose a solution that adopts open standards and promotes a Q-FAIR approach. Don't worry if your metadata or data are not perfect though- getting something out there is much better than nothing!</aside>
</section>
<section data-background-image="./images/helpsign.jpg" alt="help" class="full" data-background-size="cover">
<h3 class="box" style="background-color:var(--astun-orange);;margin-top: 300px !important">Whatever stage you're at...</h3>
<p class="fragment box" style="color:white;background-color:var(--astun-orange);margin-top: 50px !important">We can help! π <a href="https://www.astuntechnology.com">astuntechnology.com</a></p>
<p class="attribution" style="color:white">istockphoto</p>
</section>
<section data-background-image="./images/handsup.jpg" alt="help" class="full" data-background-size="cover">
<h3 class="box" style="background-color:var(--astun-teal)">Any questions?</h3>
<p class="attribution" style="color:white">U.S Embassy and Consulates in Canada, Flickr Public Domain Mark 1.0</p>
</section>
<section data-background-image="./images/fly.jpg" alt="flying dog" class="full" data-background-size="cover">
<h3 class="box" style="background-color:var(--astun-green);margin-top: 350px !important">π Thanks for attending!</h3>
<p class="fragment box" style="background-color:var(--astun-green);margin-top: 50px !important">We'll be sharing the recording with you all shortly</p>
<p class="attribution" style="color:white">US Airforce</p>
</section>
</div>
<div class="fe-persistent-header">
<img src="./images/AstunSmallLogo.png"/>
<p style="color: var(--astun-red);font-weight:600">A Beginner's Guide to Metadata</p>
</div>
</div>
<script src="dist/reveal.js"></script>
<script src="plugin/zoom/zoom.js"></script>
<script src="plugin/notes/notes.js"></script>
<script src="plugin/search/search.js"></script>
<script src="plugin/markdown/markdown.js"></script>
<script src="plugin/highlight/highlight.js"></script>
<script>
// Also available as an ES module, see:
// https://revealjs.com/initialization/
Reveal.initialize({
controls: true,
progress: true,
center: true,
hash: true,
pdfMaxPagesPerSlide: 1,
pdfSeparateFragments: true,
slideNumber: "c/t",
// Learn about plugins: https://revealjs.com/plugins/
plugins: [ RevealZoom, RevealNotes, RevealSearch, RevealMarkdown, RevealHighlight ]
});
</script>
</body>
</html>