Member-only story
Scrape & Summarize News Articles Using Python
In this article I will show you how to collect and scrape news data from different sources in a unified way using the python packages newspaper3k and nltk. The documentation for newspaper3k has all of the methods and information that you need to get started using this package. You can see the source code at https://github.com/codelucas/newspaper.
I’m a big advocate of not reinventing the wheel, and the newspaper3k package makes it very easy and simple to extract information from the web.
If you prefer not to read this article and would like a video representation of it, you can check out the YouTube Video below it goes through everything in this article with a little more detail, and will help make it easy for you to start programming in Python even if you don’t have it installed on your computer.
Programming:
The first thing that I like to do before writing a single line of code is to put in a description in comments of what the code does. This way I can look back on my code and know exactly what it does.