Introduction
This course will introduce statistical processing of natural language texts, particularly counting words and phrases in and of themselves as well as associations between them using correlations and other measures. Goals of text mining include document classification, information retrieval, source authentication, and stylistic categorization. Typical document sources are newspaper stories, email captures, and Internet pages, as well as collections of non-fiction and fiction such as Federalist Papers and Edgar Allan Poe short stories.
Teacher
David Pfeffer
ADJUNCT PROFESSOR
Email: dpfeffer@stevens.edu
Education
Bachelors of Science in Computer Science
Stevens Institute of Technology
Hoboken, NJ
Honors; Minor in Law and Public Policy
Masters of Science in Computer Science
Stevens Institute of Technology
Hoboken, NJ
4.0 GPA; Graduate Certificates in Computer Systems, Databases & Service Oriented Architecture, Distributed Systems, Enterprise Computing, Quantitative Software Engineering, Service Oriented Computing
Professional Societies
Member of the IEEE Computer Society and ACM.
Courses
CS 521 TCP/IP Networking
CS 570 Introduction to Programming, Data Structures, and Algorithms
CS 465 Selected Topics in Computer Science
SSW 810 Selected Topics in Systems Centric Software Engineering