tcguy — 2013-04-29T15:12:00-04:00 — #1
We have about 500 web pages that we need to extract data from. Each page contains the following metadata, but with a different date.
<meta name="dcterms.issued" content="2005-12-23" />
We want to extract the date from each file so that we know when the page was issued.
Any help is appreciated.
john_betong — 2013-05-03T11:43:49-04:00 — #2
That was more tedious than I thought:
Where do I send my bill
webinsane — 2013-05-17T04:19:46-04:00 — #3
Nice one John
john_betong — 2013-05-23T12:49:31-04:00 — #4
Thank you I am pleased you like it.
The OP posed the question and never returned
I hope others find the code useful.
andysky — 2013-10-11T05:24:54-04:00 — #5
It was to me. I am a new forum user, I joined for this answer.
Please can you help me? I read that Dublin Core released its spec in the (achieved) attempt to conform with RDFa Lite.
Does anyone know whether automated systems already exist with the purpose of extracting Schema: metadata with the native attributes????
Otherwise it is better to drop the <meta@name> and move to @property.