Skip to content

A simple webcrawler that prints a list of all the Articles on the first X Pages of hackernews (https://news.ycombinator.com/news) with more than Y votes.

Notifications You must be signed in to change notification settings

keshavnath/hackernews-webcrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

hackernews-webcrawler

A simple webcrawler that prints a list of all the Articles on the first X Pages of hackernews with more than Y votes.

Inputs, in the form of sys.argv[1] and sys.argv[2], are X and Y respectively, where X is the number of pages (starting from Page 1) to be searched for News Articles. Articles with more than or equal to Y votes will be displayed.

Display format:-

Title: (Title) Link: (Link) Votes: (Votes)

Simple program uses BeautifulSoup4 and Requests to get the HTML of the page and extract the useful information.

A practice project.

About

A simple webcrawler that prints a list of all the Articles on the first X Pages of hackernews (https://news.ycombinator.com/news) with more than Y votes.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages