Mining the Social Web - Web Pages
MP4 | Video: AVC 1280x720 | Audio: AAC 44KHz 2ch | Duration: 32M | 332 MB
Genre: eLearning | Language: English
MP4 | Video: AVC 1280x720 | Audio: AAC 44KHz 2ch | Duration: 32M | 332 MB
Genre: eLearning | Language: English
How do software programs that automatically extract information from web pages actually work? This video course, based on content from the book "Mining the Social Web" (O'Reilly Media) by Matthew Russell, teaches you how to create machines that can navigate the internet, cut through the noise, and extract the most important textual content from any web page or group of web pages. You'll learn how to use Python to write programs that can crawl, scrape, and parse the web; as well as discover how to extract key terms and sentences from web mined documents, explore document summarization techniques used in natural language processing and artificial intelligence, and gain experience using Python’s Natural Language Toolkit (NLTK) to auto-summarize web articles. Learners should have a basic understanding of Python.
Understand how helpful (and malicious) web bots crawl, parse, and index the web
Learn how to scrape content, extract links, and parse information from web pages and blog feeds
Discover how Python’s Natural Language Toolkit (NLTK) extracts and summarizes content