Home > DocuBase > Article

« All DocuBase Articles

 

Follow DocuTicker on Twitter Bookmark and Share   Feed

Tuesday, 15th April 2014

Text and Data Mining

Source: Directorate-General for Research and Innovation (EU)

From Executive Summary:

Text and data mining (TDM) is an important technique for analysing and extracting new insights and knowledge from the exponentially increasing store of digital data (‘Big Data’). It is important to understand the extent to which the EU’s current legal framework encourages or obstructs this new form of research and to assess the scale of the economic issues at stake.

TDM is useful to researchers of all kinds, from historians to medical experts, and its methods are relevant to organisations throughout the public and private sectors. Because TDM research technology is not prohibitively expensive, it is readily available to lone entrepreneurs, individual post-graduate students, start-ups and small firms. It is also amenable to playful and highly speculative uses, enabling research connections between previously unconnected fields. There is growing recognition that we are at the threshold of the mass automation of service industries (automation of thinking) comparable with the robotic automation of manufacturing production lines (automation of muscle) in an earlier era. TDM will be widely used to provide insights in the re-design of this digital services economy.

When it comes to the deployment of TDM, there are worrying signs that European researchers may be falling behind, especially with regard to researchers in the United States. Researchers in Europe believe that this results, at least in part, from the nature of Europe’s laws with regard to copyright, database protection and, perhaps increasingly, data privacy. In the United States, the ‘fair use’ defence against copyright infringement appears to offer greater re-assurance to researchers than the comparable copyright framework in Europe, which relies upon a closed set of statutory exceptions. Recent court decisions, for example in the ten-year old ‘Google Books’ case, appear to confirm this. The US has no equivalent of Europe’s database protection law.

+ Direct link to document (PDF; 2 MB)


Category:

Source:

Views: 1713


Having begun his career in academic libraries, Adrian Janes has subsequently worked extensively in public libraries, chiefly in enquiry work as an Information Services librarian. In this role he has had particular responsibility for information from both the UK Government and the European Union. He wrote a detailed report on sources for the latter which was published by FreePint in 2007, and has contributed articles to FreePint and ResourceShelf. He is involved in training in information literacy and the use of online reference resources.

A Contributing Editor to DocuTicker, he also write reviews for Pennyblackmusic.

Adrian can be reached at adrian.janes@freepint.com

More articles by Adrian Janes »



Please note: DocuTicker's editors collect citations for full-text PDF reports freely available on the web but we do not archive these reports. When you click a link to find and/or download the report, you are leaving the DocuTicker site. DocuTicker makes no representations regarding the ongoing availability of any report or any external resource. Links were accurate as of the date of posting.

« All DocuBase Articles

 

FreePint

FreePint supports the value of information in the enterprise. Read more »


FeedLatest FreePint Content:


  • Click to view the article Making Your Intranet Unusable
    Friday, 24th October 2014

    James Mullan has written a number of FreePint articles about some of the tools and techniques you can use to make your intranet more effective. Now in what some people might think is a cruel twist of fate, he's decided to look at some of the ways in which you can make your intranet unusable.

  • Click to view the article Assessing the Credibility of Crowdsourced Content
    Friday, 24th October 2014

    Libby Trudell investigates the role of crowdsourcing in enterprise information. She defines the term, provides an overview of quality control and gives some examples of crowdsourced research services in areas from company data to intellectual property. Finally she identifies key questions to ask if you are incorporating such resources into your research portfolio.

  • Click to view the article InfoDesk Eases the Push of Relevant Real-Time Content
    Friday, 24th October 2014

    InfoDesk is a content integration and information management solution company with an emphasis on current awareness content delivery. The company defines itself as content neutral, meaning it will integrate any electronic content to provide organisations with a custom search and content delivery system that makes the most of their information assets.

  • ... more ...

All FreePint Content »
FreePint Topics »


A FreePint Subscription delivers articles and reports that support your organisation's information practice, content and strategy.

Find out more and order a FreePint Subscription by visiting the
completing our online form: Subscription Order page.


FreePint Testimonials

"Loved it. Very impressed. It was great. A very high calibre event. The people participating were of a very high calibre. Felt that the topics ..."

Read more testimonials and supply yours »







 

 
 
 

Subscribe

Receive the DocuTicker Newsletter each week.

Find out more »

Article Categories

All Article Categories »

Sources

All DocuBase Sources »

Source Categories

All Source Categories »

Archive

All Archives »