Brown, J.;
(2010)
MERLIN: Metadata Enrichment for Repositories in a London Institutional Network.
Presented at: JISC Automatic Metadata Generation and Text Mining Projects Meeting, America Square Conference Centre, London.
![]() |
MS PowerPoint
19920.ppt Download (1MB) |
Abstract
MERLIN will use off-the-shelf text mining techniques to enrich the functionality of the SHERPA-LEAP consortial repository cross-searching service, LASSO. LASSO offers search across aggregated, normalised metadata which is collected from London-based institutional repositories using OAI-PMH harvesting. MERLIN will use the TerMine term extraction tool to derive terms from the full text digital objects held at LASSO's source repositories and, after a weighting process, enrich the LASSO database with derived keywords. The derived terms will be exposed at various points in the LASSO interface to support discovery. In a supplementary strand, MERLIN will apply thesaurus tools to construct a pilot hierarchical, browsable subject tree from the text-mined keywords. An open source, re-usable web application will be created to allow the MERLIN metadata enrichment technology to be incorporated in any repository on any platform.
Type: | Conference item (Presentation) |
---|---|
Title: | MERLIN: Metadata Enrichment for Repositories in a London Institutional Network |
Event: | JISC Automatic Metadata Generation and Text Mining Projects Meeting |
Location: | America Square Conference Centre, London |
Dates: | 25 May 2010 |
Open access status: | An open access version is available from UCL Discovery |
Language: | English |
UCL classification: | UCL > Provost and Vice Provost Offices > VP: Research > Library Services |
URI: | https://discovery-pp.ucl.ac.uk/id/eprint/19920 |
Archive Staff Only
![]() |
View Item |