-
Question:New For the knowledge representation, how can we use e-learning application?Knowledge Representation (KR) it's a good method to extract relevant information or document to the users' needs. We need this technique in the e-lea... [more]
-
Answer added to:2 Social Networks Text Mining
-
Answer added to:8 Is there any tool to get data such as author name, year of publication, topic and summary of a research paper?Have you ever tried Refviz?
-
Question:New What is currently the better algorithm for Topic Modelling?What libraries are available for Topic Modelling?
-
Answer added to:3 Conditional Random fields vs Maximum Entropy ModelsHi, , this is the link you requested
-
Answer added to:4 Do you have some hints about software and online courses (Latent semantic analysis)?Hi Diogenes The software is not free. UNED is selling a only one payment license for 150 Euros + VAT and includes some support. The software contain... [more]
-
Answer added to:3 Is clustering of sentences in a given document possible? How will the sentences be clustered?You can make clusters of sentence as you cluster document, first extract sentences from document then treat each sentence as a document , at last you... [more]
-
Answer added to:14 How useful are Topic Models in practice?Topic models such as LDA can be used for recommender systems.
-
Answer added to:5 What is the current, unstructured text summarizer algorithm?there is an interesting link I found authored by Kavitha Ganesan http://dl.acm.org/citation.cfm?id=2187954&bnc=1
-
Answer added to:6 Research report: Event based real time information filtering in short segment text streamsOk, thanks! I will actually follow the course! I found some Stanford lectures on the topic from the Social Data Lab but this is a way more interesting... [more]
-
Answer added to:4 Does anybody have any experience in textmining downloaded pdfs from EBSCO?Steven, thanks. I will look into this and will get back to you. In the meanwhile, here's a viz from the topic modeler:
-
Answer added to:6 What is, in your opininon, the best paper/tool for text extraction from web pages?Dear Nicola, I downloaded the survey (http://tinyurl.com/9e9za54) and I think it's quite exhaustive and complete. I'll take into account in the futur... [more]
-
Answer added to:9 Is it possible to write my Master Thesis under the supervision of the DMCM?Consider co-supervision, where a medical doctor supervises the need for a system to make recommendations to the medic, and a Computer Science or Softw... [more]
-
Answer added to:4 Is clause extraction from sentences possible?Is it?
-
Answer added to:2 Is there any work that already labeled email addresses or messages in the Enron dataset as spams?Thank you Andreas, I got part of the labels from TREC 2005 Spam collection. It contains 98000 labelled emails out of 0.5 M emails.
-
Answer added to:7 How can we evaluate the efficiency of a text summariser?I think that a good starting point is to give a look at Text Analysis Conference (TAC) in particular http://www.nist.gov/tac/2011/Summarization/Guided... [more]
-
Answer added to:6 Are there any (open source OR commercial) textmining software tools that can automatically extract causaility/Bayesian networks from texts?Thanks! But does that also allow to learn from text? If so, do you know of anybody who has done that (with papers we can take a look at? Anyways - thi... [more]
-
Answer added to:11 Farsi parsers?Hello, There is a Persian dependency treebank in which MST and Malt parser can be used for that. 29,982 sentences (80% train, 10% test, and 10% vali... [more]
-
Answer added to:4 How to detect loops?Well I accidentally posted the topic in the wrong place. I though I am in Signal and Image Processing topic. Anyways I got the good response. I need ... [more]
-
Answer added to:1 Can Relevance Vector Machines be used for large datasets?Have you looked into the Classic3 dataset http://www.dataminingresearch.com/index.php/2010/09/classic3-classic4-datasets/ ? There are 7095 processed ... [more]
-
Answer added to:9 What open source tools for text categorization do you use?Thanks
About Text Mining
Text mining, sometimes alternately referred to as text data mining, roughly equivalent to text analytics, refers to the process of deriving high-quality information from text. Typical text mining tasks include text categorization, text clustering, concept/entity extraction, production of granular taxonomies, sentiment analysis, document summarization, and entity relation modeling (i.e., learning relations between named entities).

