COURSE SYLLABUS Text Mining 7.5 credits Text Mining Second cycle, D7058E Version Autumn 2020 Sp 1 - Spring 2021 Sp 4Autumn 2021 Sp 1 - Present Course syllabus valid: Autumn 2021 Sp 1 - PresentThe version indicates the term and period for which this course syllabus is valid. The most recent version of the course syllabus is shown first. Education level Second cycle Grade scale U G VG Subject Information Technology Subject group (SCB) Computer Technology Main field of study Information Systems Sciences, Computer Science and Engineering Entry requirementsIn order to meet the general entry requirements for the course, you must have accomplished a minimum of 180 ECTS of university studies, out of which 60 ECTS in the areas of computer or system science. The studies shall have included Introductory Programming (for example D0009E or D0007N) and Fundamentals of Databases (for example D0004N or D0018E). The Text Mining course also requires a basic data mining course (for example D0025E Data Mining). Good knowledge in English equivalent to English 6. More information about the English language requirements [http://www.ltu.se/edu/bli-student/Application-process/English-language-requirements-1.109316?l=en]. More information about English language requirements SelectionThe selection is based on 20-285 creditsCourse Aim The objective of the course is for the student to develop their knowledge and skills in Text Mining. After passing the course, the student should be able to: [1]. Explain and use text preprocessing techniques [2]. Describe a text analytics system together with its components, optional and mandatory ones [3]. Explain how text could be analyzed [4]. Evaluate results of text analytics [5]. Analyze and reflect on the various techniques used in text analytics and the parameters needed as well as the problem solved [6]. Plan & execute a text analytics experiment Contents The Text Mining course is focusing on the importance and the difficulty of analyzing text. The Text Mining course is designed to provide students with knowledge relevant to both preprocessing of text as well as analytics of text. The Text Mining course, however, focuses on wide range of algorithms, techniques, and tools. These include standard methods, such as: tokenization, TF-IDF, n-grams, Named Entity Extraction, Sentiment Analysis, and Topic Modeling. Furthermore, recent trends in machine learning and deep learning are also covered, including: Word2Vec, Semantic Hashing, and Recurrent Neural Networks for Natural Language Processing. Various examples and use cases are used across the course. Realization Each course occasion´s language and form is stated and appear on the course page on Luleå University of Technology's website. Lectures, labs, assignments, case studies and/or project work. During the course, the students work with individual tasks and/or group work. Some assignments or case studies in the course might contain work in contact with or about the industry. The student uses different methods and techniques, and it is important to choose the right method, technique or computer support for each task. Before and after the tasks are solved, there are lectures to present and discuss different solutions. Teaching is in English and on the Internet for distance students or on campus for students living here. IT support: Learning management system, e-mail and phone. The learning management system is used for delivering course material, information and submissions. Knowledge is shared and created within the course through virtual meetings with teachers and other students for discussions, supervision, teamwork and seminars. For students on campus, there will be meetings on campus. Examination If there is a decision on special educational support, in accordance with the Guideline Student's rights and obligations at Luleå University of Technology, an adapted or alternative form of examination can be provided. Through written tests, individual and group/project assignment, different student abilities are examined. Those are: the ability to explain and use text mining techniques and the ability to address business problems using text mining individually and in groups. Remarks Technical Requirements: access to computer with administrative rights, web camera, microphone and Internet connection. ExaminerMarcus LiwickiLiterature. Valid from Autumn 2020 Sp 1 (May change until 10 weeks before course start)Title: Practical Text Analytics: Interpreting Text and Unstructured Data for Business Intelligence (Marketing Science)Author: Steven StruhlPublisher: Kogan Page, 1st edition, 2015ISBN: 978-07499474010 Title: Deep Text: Using Text Analytics to Conquer Information Overload, Get Real Value from Social Media, and Add Bigger Text to Big DataAuthor: Tom ReamyPublisher: Information Today Inc, 2016ISBN 9781573875295 Title: Text Mining and Analysis: Practical Methods, Examples, and Case Studies Using SASAuthor: Goutam Chakraborty, Murali Pagolu, & Satish GarlaPublisher: SAS Institute, 2014ISBN 9781612905518 Search books in the library » Course offered byDepartment of Computer Science, Electrical and Space EngineeringModules CodeDescriptionGrade scaleHPStatusFrom periodTitle 0002Individual taskU G#1.50MandatoryA20 0003Group-/Project workU G#2.00MandatoryA20 0004Written exam/Individual examU G VG4.00MandatoryA21 Study guidanceStudy guidance for the course is to be found in our learning platform Canvas before the course starts. Students applying for single subject courses get more information in the Welcome letter. You will find the learning platform via My LTU.Syllabus establishedby Jonny Johansson, HUL SRT 21 Feb 2020Last revisedby Jonny Johansson, HUL SRT 17 Feb 2021