<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>gnTEAM &#187; Search Results  &#187;  &#8220;Data Mining&#8221;</title>
	<atom:link href="http://gnteam.cs.manchester.ac.uk/search/%22Data+Mining%22/feed/rss2" rel="self" type="application/rss+xml" />
	<link>http://gnteam.cs.manchester.ac.uk</link>
	<description>Text extraction, analytics, mining</description>
	<lastBuildDate>Fri, 05 Mar 2021 17:44:55 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.2.37</generator>
	<item>
		<title>Temporal expression extraction with extensive feature type selection and a posteriori label adjustment</title>
		<link>http://gnteam.cs.manchester.ac.uk/publication/297998-temporal/</link>
		<comments>http://gnteam.cs.manchester.ac.uk/publication/297998-temporal/#comments</comments>
		<pubDate>Mon, 07 Mar 2016 11:11:15 +0000</pubDate>
		<dc:creator><![CDATA[mbelousov]]></dc:creator>
		
		<guid isPermaLink="false">http://gnteam.cs.manchester.ac.uk/?post_type=publication&#038;p=1642</guid>
		<description><![CDATA[<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/publication/297998-temporal/">Temporal expression extraction with extensive feature type selection and a posteriori label adjustment</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/publication/297998-temporal/">Temporal expression extraction with extensive feature type selection and a posteriori label adjustment</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gnteam.cs.manchester.ac.uk/publication/297998-temporal/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Linked2Safety &#8211; a next-generation, secure linked-data medical information space for semantically-interconnecting electronic health records and clinical trials systems</title>
		<link>http://gnteam.cs.manchester.ac.uk/project/linked2safety/</link>
		<comments>http://gnteam.cs.manchester.ac.uk/project/linked2safety/#comments</comments>
		<pubDate>Thu, 02 Jul 2015 10:29:26 +0000</pubDate>
		<dc:creator><![CDATA[admin]]></dc:creator>
		
		<guid isPermaLink="false">http://gnode.dev/?post_type=project&#038;p=1546</guid>
		<description><![CDATA[<p>The main aim of the Linked2Safety project is to explore the Semantic Web and Linked Data to facilitate semantic interlinking of electronic health records (EHRs) and clinical trials systems for gathering and sharing knowledge to support decision making in medical and clinical research. The vision is to facilitate early detection&#8230; </p>
<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/project/linked2safety/">Linked2Safety &#8211; a next-generation, secure linked-data medical information space for semantically-interconnecting electronic health records and clinical trials systems</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>The main aim of the Linked2Safety project is to explore the Semantic Web and Linked Data to facilitate semantic interlinking of electronic health records (EHRs) and clinical trials systems for gathering and sharing knowledge to support decision making in medical and clinical research. The vision is to facilitate early detection of patients&#8217; safety issues, the identification of adverse events and the identification of a suitable critical mass of patients to participate in small (Phases II and III) or larger scale (Phase IV) clinical trials.</p>
<p>Our role is focused on the design of an interoperable EHR data space and development of bio-marker data mining techniques for adverse events early detection. We will also provide several clinical trials showcases and organise the <a href="http://www.linked2safety-project.eu/sig">Clinical research and patients safety Special Interest Group</a>.</p>
<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/project/linked2safety/">Linked2Safety &#8211; a next-generation, secure linked-data medical information space for semantically-interconnecting electronic health records and clinical trials systems</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gnteam.cs.manchester.ac.uk/project/linked2safety/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Integration of text and data mining in life sciences</title>
		<link>http://gnteam.cs.manchester.ac.uk/project/text-and-data-mining-in-life-sciences/</link>
		<comments>http://gnteam.cs.manchester.ac.uk/project/text-and-data-mining-in-life-sciences/#comments</comments>
		<pubDate>Fri, 26 Jun 2015 13:57:52 +0000</pubDate>
		<dc:creator><![CDATA[admin]]></dc:creator>
		
		<guid isPermaLink="false">http://gnode.dev/?post_type=project&#038;p=271</guid>
		<description><![CDATA[<p>There have been numerous efforts to provide tools for storing, extracting and analysing data in life sciences. Interoperability and integration of such efforts is a challenging issue, not only technically (e.g. different formats, protocols, encodings) but also more importantly semantically. We are involved in a number of community-driven initiatives to&#8230; </p>
<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/project/text-and-data-mining-in-life-sciences/">Integration of text and data mining in life sciences</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>There have been numerous efforts to provide tools for storing, extracting and analysing data in life sciences. Interoperability and integration of such efforts is a challenging issue, not only technically (e.g. different formats, protocols, encodings) but also more importantly semantically. We are involved in a number of community-driven initiatives to provide better integration for life science research.<br />
One initiative is to provide harmonised ways for representing and tagging named entities in the life science literature. We are proposing to establish common document formats that facilitate the exchange of annotation results contained in the literature as a complementary approach to the development of interoperable tools. We work towards (a) recommendations for a common syntax to embody entity mentions in publishers&#8217; document formats (e.g., into PMC), and (b) provision of a common way to reference semantic types. The initial results have been implemented in the IeXML proposal, which has already been used in some community-wide projects (e.g. CALBC). The original IeXML paper is available here.<br />
<strong>Involved</strong>: D. Rebholz (EBI), G. Nenadic<br />
Another initiative is to use ontologies and text mining to integrate and mark up data (both structured and unstructured) and provide semantics-based faceted browsing to help users navigate, query and retrieve data. The Ontogrator platform has been developed by the NERC Environmental Bioinformatics Centre and the University of Manchester, with a pilot implementation developed in collaboration with the Genomic Standards Consortium (GSC) that includes integrated content from the StrainInfo, GOLD, CAMERA, Silva and Pubmed databases.<br />
<strong>Involved</strong>: D. Field (NEBC), N. Morrison (Manchester), D. Hancock, L. Hirschman, G. Nenadic, et al.<br />
As part of the BBSRC-funded pubmed2ensembl project, we have developed a customised and extended version of the Ensembl BioMart by adding gene-related publication information, i.e. PubMed-IDs and PubMed Central-IDs including URL link-outs and other information. The pubmed2ensembl BioMart has an enhanced interface that permits to carry out interactive full-text search queries via NCBI&#8217;s Entrez Utilities (eUtils), whose search results are applied as an additional filter on the mart datasets. The system also provides DAS link-outs into the Ensembl Genome Browser, where a custom DAS track summarises the publication data that have been accumulated on a per gene basis.<br />
<strong>Involved</strong>: J. Baran, C. Bergman, G. Nenadic, M. Gerner</p>
<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/project/text-and-data-mining-in-life-sciences/">Integration of text and data mining in life sciences</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gnteam.cs.manchester.ac.uk/project/text-and-data-mining-in-life-sciences/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Prof John Keane</title>
		<link>http://gnteam.cs.manchester.ac.uk/staff/jkeane/</link>
		<comments>http://gnteam.cs.manchester.ac.uk/staff/jkeane/#comments</comments>
		<pubDate>Thu, 25 Jun 2015 16:12:56 +0000</pubDate>
		<dc:creator><![CDATA[admin]]></dc:creator>
		
		<guid isPermaLink="false">http://gnode.dev/?post_type=staff&#038;p=237</guid>
		<description><![CDATA[<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/staff/jkeane/">Prof John Keane</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/staff/jkeane/">Prof John Keane</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gnteam.cs.manchester.ac.uk/staff/jkeane/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Information for prospective postgraduate students</title>
		<link>http://gnteam.cs.manchester.ac.uk/contact/prospective-postgraduates/</link>
		<comments>http://gnteam.cs.manchester.ac.uk/contact/prospective-postgraduates/#comments</comments>
		<pubDate>Mon, 22 Jun 2015 12:21:04 +0000</pubDate>
		<dc:creator><![CDATA[admin]]></dc:creator>
		
		<guid isPermaLink="false">http://gnode.dev/?page_id=62</guid>
		<description><![CDATA[<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/contact/prospective-postgraduates/">Information for prospective postgraduate students</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></description>
				<content:encoded><![CDATA[<div class="osc-res-tab tabbable   osc-tabs-left"><div style="clear:both;width: 100%;"><ul class="nav osc-res-nav nav-pills osc-tabs-left-ul" id="oscitas-restabs-1-prospective-postgraduates-51889"><li class="active"><a href="./#general-information" data-toggle="tab">General information</a></li><li class=""><a href="./#themes" data-toggle="tab">Themes</a></li><li class=""><a href="./#application-steps" data-toggle="tab">Application steps</a></li><li class=""><a href="./#funding" data-toggle="tab">Funding</a></li><li class=""><a href="./#environment" data-toggle="tab">Environment</a></li></ul></div><div style="clear:both;width: 100%;"><ul class="tab-content" id="oscitas-restabcontent-1-prospective-postgraduates-51889"><li class="tab-pane active" id="general-information"></p>
<h3>General information</h3>
<p>We are always keen to have postgraduate research students in various areas of text mining and natural language processing. As a rule of thumb, you will need to have an xmaplesxcellent first degree in computer science or related area (e.g. computational lingustics, mathematics, physics, bioinformatics), with very good programming experience and some experience in natural language processing (e.g. final year project, summer internship, an ad-hoc project). An MSc or publications in a related area will be also a distinctive advanatage <a href="https://writing-help.org/blog/domestic-terrorism-essay" style="text-decoration: none; color: inherit;">try terrorism essay for free</a>.</p>
<p>The main theme of our research is <strong>feature engineering</strong> from unstructured documents written in natural languages. We investigate methodologies for the extraction of both explicit and implicit features from large collections of textual documents. Features can be terms, names, relations, co-occurances, events, etc. Once engineered from text, the features can be used to provide understanding and reasoning over knowledge (e.g. by applying machine learning or data mining) &#8211; this discipline is referred to as text analytics, text mining or more generally natural language processing (NLP).</p>
<p></li><li class="tab-pane " id="themes"></p>
<h3>Themes</h3>
<p>Here are some core <strong>text mining themes</strong> (please see below for details) that are currently the focus in our TEAM:</p>
<ul>
<li><strong>Text analytics and sentiment analysis</strong>: identification of subjective opinion and sentiment features from user-generated content (e.g. blog mining, tweets, etc.);</li>
<li><strong>Extracting negations, contrasts and contradictions</strong>: identification of utterances that are negated, or contrast or contradict some other expressions (both explicit and implicit);</li>
<li><strong>Concept mining and structuring</strong>: learning and identification of concepts and terminology from text, including their structuring (internal and external);</li>
<li><strong>Temporal text analytics</strong>: identification of temporal expressions and their scope in text;</li>
<li><strong>Integrated text and data mining</strong>: combining the results from different perspectives using various methods from machine learning;</li>
<li><strong>Text processing midleware for the Semantic Web</strong>: building an infrastructure to support building text mining solutions for the Semantic Web (identification of concepts, links, etc);</li>
</ul>
<p>and these are preferred <strong>application areas</strong>:</p>
<ul>
<li>Biology and biomedicine (molecular interactions, cancer studies, characterisation of molecular events, etc.)</li>
<li>Bioinformatics and computational biology (tools, services, resources, methods)</li>
<li>Clinical medicine and health-care (clinical decision support, quality of life monitoring)</li>
<li>E-science, e-commerce and e-government (e.g. monitoring, tracking, dissemination of information)</li>
<li>Engineering (knowledge management)</li>
</ul>
<p>You would typically &#8216;select&#8217; a topic that consist of a particular theme in a specific application area. I&#8217;d be also happy to consider proposals in the areas of <strong>multi-lingual text mining</strong> and <strong>NLP for Serbian</strong>.</p>
<p></li><li class="tab-pane " id="application-steps"></p>
<h3>Application steps</h3>
<p>You will be expected to have passion for text processing, in addition to an excellent first degree in computer science or related area. Some experience in natural language processing is very useful, whereas very good programming experience (in a combination of programming languages) is a must. If you belive you&#8217;ve got all these, send an email to Goran Nenadic (see below) with a full CV and a brief note as why you would like to do PhD in our TEAM. Please allow some time for us to reply. Contact email: <a href="mailto:G.Nenadic@manchester.ac.uk">G.Nenadic@manchester.ac.uk</a>.</p>
<p></li><li class="tab-pane " id="funding"></p>
<h3>Funding</h3>
<p>PhD studies are between 3 and 4 years, typically closer to 4 than to 3 years. There is only one route for securing funding: the candidate needs to be outstanding. There are 3 possible sources of funding:</p>
<ul>
<li>specific, pre-defined projects (NONE CURRENTLY),</li>
<li>funding from the School of Computer Science (see <a href="http://cdt.cs.manchester.ac.uk/" target="_blank">here</a> for details) and</li>
<li>external funding (private, external bodies &#8211; e.g. foreign governments, etc).</li>
</ul>
<p></li><li class="tab-pane " id="environment"></p>
<h3>Environment</h3>
<p>The School of Computer Science is one of the leading Schools in the UK reknown for the excellence of its research. The world&#8217;s first computer with internal memory was build in the School and Alan Turing has laid the foundations of Computer Science and Artificial intelligence while in Manchester. The international reputation of our research reflects on its high ranking in the last national Research Assessment Exercise (RAE), which places the School among the best five Computer Science departments in the UK and top in England for research power. The School has a vibrant research environment with more than 150 PhD students, 90 research staff and 70 academic staff.</p>
<p>Our research <a href="http://gnode.dev/people/">TEAM</a> is part of the Text Mining/NLP research group, which hosts the UK National Centre for Text Mining. We are also affiliated to <a href="http://www.mib.ac.uk/" target="_blank">the Manchester Interdisciplinary BioCentre</a>. The team is vibrant, diverse and very much international.</p>
<p></li></ul></div></div>
<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/contact/prospective-postgraduates/">Information for prospective postgraduate students</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gnteam.cs.manchester.ac.uk/contact/prospective-postgraduates/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Training</title>
		<link>http://gnteam.cs.manchester.ac.uk/training/</link>
		<comments>http://gnteam.cs.manchester.ac.uk/training/#comments</comments>
		<pubDate>Mon, 22 Jun 2015 12:04:38 +0000</pubDate>
		<dc:creator><![CDATA[admin]]></dc:creator>
		
		<guid isPermaLink="false">http://gnode.dev/?page_id=28</guid>
		<description><![CDATA[<p>gnTEAM provides traninig in topics related to text mining for undergraduate (BSc final year projects) and postgraudate students (MSc, MPhil, PhD and EngD projects). Final year undergraduate and MSc projects associated with the team are announced annually as part of the School of Computer Science taught programmes. The current research post-graduate&#8230; </p>
<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/training/">Training</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>gnTEAM provides traninig in topics related to text mining for undergraduate (BSc final year projects) and postgraudate students (MSc, MPhil, PhD and EngD projects).<br />
Final year undergraduate and MSc projects associated with the team are announced annually as part of the School of Computer Science taught programmes.</p>
<p>The current <strong>research post-graduate themes</strong> include:</p>
<ul>
<li>Integrated and Contrastive Text and Data Mining</li>
<li>Text Analytics and Blog/Forum Sentiment Analysis</li>
<li>Extracting negations, contrasts and contradiction from biomedical literature</li>
<li>Clinical text mining</li>
<li>Text mining in engineering</li>
</ul>
<p>More specific post-graduate information is available <a href="http://gnode.dev/contact/prospective-postgraduates/">here</a>. For PhD funding opportunities see <a href="http://www.cs.manchester.ac.uk/study/postgraduate-research/programmes/cdt/" target="_blank">CDT in Computer Science</a>.</p>
<h2>Selected completed student projects</h2>
<div class="table-responsive"><table  style="width:100%; "  class="easy-table easy-table-default " border="0">
<thead>
<tr><th >Student Name</th>
<th >Project Title</th>
<th >Year</th>
</tr>
</thead>
<tbody>
<tr><td >E. Hein</td>
<td > EDViC: a web application to visualise and explore epidemiological literature (BSc project)</td>
<td > 2013</td>
</tr>

<tr><td >T. Patel</td>
<td > Analysing Twitter Posts to Discover and Review New Software Tools (BSc project)</td>
<td > 2012</td>
</tr>

<tr><td >B. Dumitru</td>
<td > Mining twitter data to gather information about pharmaceutical drugs (BSc project)</td>
<td > 2012</td>
</tr>

<tr><td >I. Townend</td>
<td > Mapping of Clinical Data between Heterogeneous Terminologies and Classifications (MSc project)</td>
<td > 2011</td>
</tr>

<tr><td >S. Asif</td>
<td > An Analysis of Financial Blogs and Forums (MSc project)</td>
<td > 2010</td>
</tr>

<tr><td >A. Dehghan</td>
<td > A Rule-based Approach to External Context Extraction from Biomedical Literature: URL and Role Extraction (MSc project)</td>
<td > 2010</td>
</tr>

<tr><td >A. Tsoutsoumpi</td>
<td > A question answering system from FAQ pages (MSc project)</td>
<td > 2010</td>
</tr>

<tr><td >D. Yang</td>
<td > Extending Areca with Remote Backup Features (BSc project)</td>
<td > 2010</td>
</tr>

<tr><td >S. Latif</td>
<td >Automatic Summarisation As Pre-Processing For Document Clustering (PhD project)</td>
<td > 2010</td>
</tr>

<tr><td >M. Greenwood</td>
<td >Prioritising links for Topic-focused Web Crawling using Lexical and Terminological Profiling (MPhil project)</td>
<td > 2009</td>
</tr>

<tr><td >H. Afzal</td>
<td >A Literature-Based Framework for Semantic Descriptions of E-Science Resources (PhD project)</td>
<td > 2009</td>
</tr>
</tbody></table></div>
<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/training/">Training</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gnteam.cs.manchester.ac.uk/training/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>LINNAEUS: A species name identification system for biomedical literature</title>
		<link>http://gnteam.cs.manchester.ac.uk/publication/113903-linnaeus/</link>
		<comments>http://gnteam.cs.manchester.ac.uk/publication/113903-linnaeus/#comments</comments>
		<pubDate>Tue, 10 Nov 2015 11:46:18 +0000</pubDate>
		<dc:creator><![CDATA[admin]]></dc:creator>
		
		<guid isPermaLink="false">http://localhost/~mbelousov/wp/?post_type=publication&#038;p=1488</guid>
		<description><![CDATA[<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/publication/113903-linnaeus/">LINNAEUS: A species name identification system for biomedical literature</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>The post <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk/publication/113903-linnaeus/">LINNAEUS: A species name identification system for biomedical literature</a> appeared first on <a rel="nofollow" href="http://gnteam.cs.manchester.ac.uk">gnTEAM</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gnteam.cs.manchester.ac.uk/publication/113903-linnaeus/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
