What do you do when you need to migrate content from one blog to another?
Ideally, you use an API. But if the origin blog doesn’t provide one, you may need to build your own script to scrape the pages and get all the content in the desired format.
If you’re scraping pure HTML pages that share a similar structure, you can use Cheerio and Node.js to get the content and output it as JSON.
This is not the only option out there, but it’s probably the easiest if you know a bit of JavaScript already, and if you’re not dealing with dynamic content. For scraping more complex pages that include JS, you may want to use a tool like Puppeteer.

#html

Cheerio Script for Turning HTML Pages  Into JSON Files
6.50 GEEK