Parse & categorize any online news article written in English

The most advanced article extraction API with category prediction, auto-generated summary, extract all images, and more.

Automatic text categorization using Machine Learning

Extracts Full HTML/Text content

A.I. predicted "Categories"

Summary generation using NLP & A.I.

Extracts all images from the article

Try Extract Now!

What you can extract from a URL

The main headline and body of the article or news.

Article text

The main headline and body of the article or news.

Know exact time the article got published

Date & Time Published

Know exact time the article got published

Extract Media Links

Media Links

A list of URLs of all media inside the article body.

Know what each article is about

Article's Topic

Know what each article is about

Get the clean HTML of the article


Get the clean HTML of the article

Top keywords from the text


Top keywords from the text

Don't let your growth slow you down

Perfect for Consumer-Facing Apps



All API requests are cached for “7” days by default, making the API ready to be integrated with no extra work.


Low Latency

The lowest in-class latency for extract API that you can get.



Your app went viral? Don’t worry! The API scales flawlessly to millions of API calls.

Using Machine Learning For Topic Detection

We’ve trained our own model that detects more than 50 topics

Just what you need

All juice—no pulp. The Article feature removes extraneous information from blog posts and articles, leaving you with just the content you want. Stripped of unneeded info, articles display the way you prefer, making your site look great.



The API also returns a summary of the given article, selecting the most relevant sentences to sum up what it is about using NLP AI models.

Pipfeed's extract API

Avoid months of development time building your own URL extractor and use Pipfeed’s API today.

Everything you need from an News Extract API

Extracts full HTML/Text

Using A.I. we extract full HTML even from javascript heavy websites.

Consistent Categories

Get auto predicted categories to better organize your extracted content


Get full metadata of the article including images, keywords, tags, and more.


API Calls
1 sec
Avg. Latency
Success Rate



20 API calls/day
  • 20 API call/day


50,000 API calls
  • 10K API/month


50k API calls
  • 50K API/month

Latest from our Blog

Tutorials & Usecases for News Extract API

Generate Embeddable HTML code for any URL using Pipfeed's Extract API

Generate Embeddable HTML code for any URL using Pipfeed’s Extract API

Embeddable Cards provide a clean, responsive, and shareable card for any content on the web. Cards are the easiest way to leverage Pipfeed’s extract API for any media, Cards provide a responsive embed. 40% of Users will click, hover, or view Cards with videos, images, and rich media. Cards are responsive and adapt to automatically fit any site they are placed in.

But a lot of these embed APIs aren’t very customizable and usually results in a longer load time. Using Pipfeed’s extract API, you can generate a pure HTML code in the framework and style of your choice. For this example we will be using bootstrap cards to style the generated cards.

Read More »

Our Tech

Pipfeed’s API runs on AWS and is made by ex-AWS software engineers.

This same API powers the Pipfeed mobile app used by thousands of daily readers.

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from Youtube
Consent to display content from Vimeo