Monday, 19th March 2012
The Value and Benefits of Text Mining
Source: JISC (UK)
From the Executive Summary:
Vast amounts of new information and data are generated everyday through economic, academic and social activities. This sea of data, predicted to increase at a rate of 40% p.a., has significant potential economic and societal value. Techniques such as text and data mining and analytics are required to exploit this potential.
Businesses use such techniques to analyse customer and competitor data to improve competitiveness; the pharmaceutical industry mines patents and research articles to improve drug discovery; within academic research, mining and analytics of large datasets are delivering efficiencies and new knowledge in areas as diverse as biological science, particle physics and media and communications.
The global research community generates over 1.5 million new scholarly articles per annum. As the recent Hargreaves report into ‘Digital Opportunity: A Review of Intellectual Property and Growth’ highlighted, text mining and analytics of this scholarly literature and other digitised text affords a real opportunity to support innovation and the development of new knowledge. However, current UK copyright laws are restricting this use of text mining. To remedy this, Hargreaves proposes an exception to support text mining and analytics for non-commercial research.
In order to be ‘mined’, text must be accessed, copied, analysed, annotated and related to existing information and understanding. Even if the user has access rights to the material, making annotated copies can be illegal under current copyright law without the permission of the copyright holder.
+ Direct link to Report (PDF; 6.8 MB)
By Adrian Janes
Having begun his career in academic libraries, Adrian Janes is currently an Information Services Librarian with the London Borough of Havering.
In this role, he has particular responsibility for information from both the UK Government and the European Union. He wrote a detailed report on sources for the latter which was published by Free Pint Ltd. in 2007. He is also involved in training and publicising online reference resources and is a regular contributor to DocuTicker.
Adrian can be reached at firstname.lastname@example.org
More articles by Adrian Janes »
Please note: DocuTicker's editors collect citations for full-text PDF reports freely available on the web but we do not archive these reports. When you click a link to find and/or download the report, you are leaving the DocuTicker site. DocuTicker makes no representations regarding the ongoing availability of any report or any external resource. Links were accurate as of the date of posting.