Is there any free scrape package/too etc in python which can crawle a wordpress site and skip the ad block

Davin kARIN

New member
Joined
Dec 13, 2024
Messages
1
Points
0
I want to keep the orignal html , only remove the ad block
Is there any existing solution for this purpose?
 

Ni enow

New member
Joined
Dec 13, 2024
Messages
1
Points
0
Not sure of a pre built library... Newspaper is ok from Python but it hasn't been updated in years so YMMV.

Most WordPress sites don't disable access to the wp-json V2 API so if you want to get rid of a lot of website fluff be and make less requests to the server, just put
wp-json/wp/v2/posts (or pages)
At the end of the homepage URL and you'll be able to see everything. You can read more about it following the WordPress WP Json API documentation.

Hope this helps.
 
Top