Text Purify API is designed to transform the way you interact with web content, providing a robust and efficient solution for extracting relevant text from articles and web pages. In a world flooded with information, this API becomes an essential tool for users looking to get clean, meaningful data without the clutter of ads, menus and other unwanted elements.
The Text Purify API is a cloud-based service that allows users to extract the core content of web articles with high accuracy. This API is ideal for applications that require the collection and analysis of content from news, blogs, research and more. It uses advanced natural language processing (NLP) and machine learning techniques to identify and extract relevant text, ensuring that only valuable information is delivered to the user. The API is equipped with sophisticated algorithms that recognise and extract the main body text of a web page. This includes identifying the main text of articles and automatically excluding ads, menus, sidebars and other non-essential elements.
It can handle a wide variety of web page formats and layout styles, ensuring that content extraction is effective regardless of website design. The API is designed to work with content in different languages, making it versatile for global applications. A simple and well-documented application programming interface (API) is provided, making it easy to integrate with your existing applications and workflows. The API provides fast responses, which is crucial for real-time applications and large-scale data analysis. This enables a smooth and efficient user experience.
The Text Purify API receives a URL and optional settings, and provides clean text of the article, excluding ads, along with metadata such as title and author.
Uses the API to extract the main text of articles from multiple news sources and present them in a unified platform, improving the user experience by avoiding ads and irrelevant content.
Facilitates the collection of information from academic and research articles, allowing researchers to extract the essential content for analysis and review without the distractions of advertising.
Create applications that generate concise summaries of web articles by extracting only the main, relevant content, offering users more digestible versions of long texts.
Enables content curators to extract and present only the most relevant text from articles and publications, ensuring their audiences receive high quality information without distracting elements.
Extracts relevant content from online reviews and articles to perform sentiment analysis, helping companies better understand public perception of their products or services.
Basic Plan: 50 requests per minute.
Pro Plan: 100 requests per minute.
Pro Plus Plan: 240 requests per minute.
Premium Plan: 360 requests per minute.
{"error":0,"message":"Article extraction success","data":{"url":"https://ellzey.house.gov/2024/10/congressman-jake-ellzey-s-statement-on-fema-aid","title":"Congressman Jake Ellzey's Statement on FEMA Aid","description":"The Department of Homeland Security, under Secretary Mayorkas, has taken actions that make illegal immigration more attractive by reallocating funds that should be prioritized for disaster relief efforts. At...","links":["https://ellzey.house.gov/2024/10/congressman-jake-ellzey-s-statement-on-fema-aid"],"image":"https://ellzey.house.gov/vendor/_accounts/jakeellzey/_skins/062422/images/social_card.png","content":"<div>\n<article>\n<a></a>\n<div><p>The Department of Homeland Security, under Secretary Mayorkas, has taken actions that make illegal immigration more attractive by reallocating funds that should be prioritized for disaster relief efforts. At a time when FEMA is warning that they do not have enough funding to cover the rest of the hurricane season, money has been funneled into programs that provide aid to noncitizen migrants.</p>\r\n<p>Over $1 billion has been directed to programs like the Shelter and Services Program (SSP) and the Emergency Food and Shelter Program, which have been repurposed to support illegal immigrants. With 150,000 households already relying on FEMA aid after devastating hurricanes, this is a gross misallocation of resources.</p>\r\n<p>The current Administration needs to stop diverting taxpayer money to initiatives that encourage illegal immigration and instead focus on supporting the American people and their immediate needs during natural disasters.</p>\r\n<p>Here is what we know: </p>\r\n<ul>\r\n<li>Homeland Security Secretary Alejandro Mayorkas said Federal Emergency Management Agency (FEMA) can meet immediate needs but does not have enough funds for the rest of Hurricane season.</li>\r\n<ul>\r\n<li>Congress recently granted $20 Billion for FEMA’s disaster relief fund as part of the September continuing resolution.</li>\r\n<li>The Biden Administration has granted North Carolina additional aid in the recovery effort with a 100 percent federal cost share for debris removal and emergency protective measures for six months.</li>\r\n<li>150,000 households have registered for FEMA aid.</li>\r\n</ul>\r\n<li>The Shelter and Services Program (SSP) administered by FEMA provides financial support to non-federal agencies to provide humanitarian services to “noncitizen migrants.”</li>\r\n<ul>\r\n<li>FEMA, on their website, said they have funneled at least $1 billion into the program between FY23 and FY24.</li>\r\n<li>New York City’s Department of Homeless Services has given $4,000 in grants to 150 families to help illegal immigrants settle into permanent homes.</li>\r\n<li>The Emergency Food and Shelter Program, also under FEMA, was repurposed into a fund for Illegal immigrants. Many of these funds went to Catholic Charities on the border, totaling $13,937,331 in 2023.</li>\r\n</ul>\r\n</ul>\r\n<ul>\r\n<li>Secretary Mayorkas’ response is that SSP is a separate appropriated account from disaster relief and is not associated with those funding streams.</li>\r\n<ul>\r\n<li>On FEMA’s website, they claim, “No money is being diverted from disaster response needs. FEMA’s disaster response efforts and individual assistance are funded through the Disaster Relief Fund, which is a dedicated fund for disaster efforts. Disaster Relief Fund money has not been diverted to other, non-disaster related efforts.”</li>\r\n<li>The December 2022 consolidated funding bill authorizing the split-off program for spending on migrants vaguely described the purpose as for “providing shelter and other services to families and individuals encountered by the Department of Homeland Security.”</li>\r\n</ul>\r\n</ul>\n<p>######</p></div>\n</article>\n</div>","author":"@RepEllzey","favicon":"https://ellzey.house.gov/vendor/_accounts/jakeellzey/_skins/062422/images/favicon.ico","source":"ellzey.house.gov","published":"2024-10-07T04:00:00Z","ttr":86,"type":"article"}}
curl --location --request GET 'https://pr197-testing.zylalabs.com/api/4949/text+purify+api/6229/article+extract?url=https://css-tricks.com/empathetic-animation/&word_per_minute=300&desc_truncate_len=210&desc_len_min=180&content_len_min=200' --header 'Authorization: Bearer YOUR_API_KEY'
{"error":0,"message":"Article extraction success","data":{"url":"https://cryptobriefing.com/fidelity-ethereum-etf-dtcc-listing/","title":"Fidelity's Ethereum spot ETF listed on DTCC under ticker $FETH","description":"Fidelity's spot Ethereum fund is now listed on DTCC under ticker $FETH following SEC's approval of multiple Ethereum ETFs.","links":["https://cryptobriefing.com/fidelity-ethereum-etf-dtcc-listing/"],"image":"https://static.cryptobriefing.com/wp-content/uploads/2024/05/29232455/img-HBnmOBf0yYWOnnbZiut1I8BO-800x457.jpg","content":"<div>\n <section>\n <h2>SEC's approval process for Ethereum ETFs underway, trading awaits S-1 filings.</h2>\n </section>\n <section>\n <picture>\n <source media=\"(min-width: 850px)\" srcset=\"https://static.cryptobriefing.com/wp-content/uploads/2024/05/29232455/img-HBnmOBf0yYWOnnbZiut1I8BO-800x457.jpg\"></source>\n <img src=\"https://static.cryptobriefing.com/wp-content/uploads/2024/05/29232455/img-HBnmOBf0yYWOnnbZiut1I8BO-400x228.jpg\" alt=\"Fidelity's spot Ethereum ETF listed on DTCC under ticker $FETH\" title=\"Fidelity’s spot Ethereum ETF listed on DTCC under ticker $FETH\" />\n </picture>\n </section>\n <section>\n <p>Fidelity’s Ethereum spot ETF has been listed on the Depository Trust and Clearing Corporation (DTCC) under the ticker symbol $FETH. This development comes on the heels of the US Securities and Exchange Commission’s (SEC) <a href=\"https://cryptobriefing.com/sec-ethereum-etf-approval/\" target=\"_blank\">approval of spot Ethereum exchange-traded funds</a> (ETFs) on May 23.</p><figure><img src=\"https://static.cryptobriefing.com/wp-content/uploads/2024/05/29225708/Fidelity-Ethereum-ETF-on-DTCC.jpg\" /><figcaption>Fidelity’s Ethereum spot ETF is now listed on <a href=\"https://www.dtcc.com/products/cs/exchange_traded_funds_plain_new.php\" target=\"_blank\">DTCC</a></figcaption></figure><p>BlackRock’s Ethereum fund, iShares Ethereum Trust, is listed on the DTCC <a href=\"https://cryptobriefing.com/blackrock-ethereum-etf-dtcc/\" target=\"_blank\">under ticker $ETHA</a>. VanEck’s Ethereum ETF is listed <a href=\"https://cryptobriefing.com/vaneck-dtcc-ethereum-etf-listing/\" target=\"_blank\">under ticker $ETHV</a> and Franklin Templeton’s <a href=\"https://cryptobriefing.com/franklin-templeton-ethereum-etf-dtcc-listing/\" target=\"_blank\">under ticker $EZET</a>.</p><p>The SEC’s acceptance of the 19b-4 forms for the spot Ethereum ETFs marks a major step, although the commencement of trading awaits the approval of each ETF’s S-1 filing.</p><p>Discussions between the SEC and ETF issuers about the S-1 forms are reportedly <a href=\"https://cryptobriefing.com/sec-engages-ethereum-etf-issuers-s-1-forms/\" target=\"_blank\">underway</a>. However, the timeframe for the trading approval is uncertain, with projections ranging from weeks to months.</p><p>VanEck was among the first to submit an amended S-1 form on May 23, with BlackRock following suit with an <a href=\"https://cryptobriefing.com/blackrock-ethereum-etf-launch/\" target=\"_blank\">updated S-1 filing</a> today. The S-1 form serves as an initial registration document that must be filed with the SEC before a security can be offered to the public.</p>\n </section>\n <section>\n <a href=\"https://cryptobriefing.com/disclaimer/\" target=\"_blank\">\n Disclaimer </a>\n </section>\n</div>","author":"@crypto_briefing","favicon":"https://static.cryptobriefing.com/wp-content/uploads/2020/02/02093517/ios-144.png","source":"cryptobriefing.com","published":"2024-05-30T17:14:47+00:00","ttr":40,"type":"article"}}
curl --location --request GET 'https://pr197-testing.zylalabs.com/api/4949/text+purify+api/6230/article+proxy+extract?url=https://cryptobriefing.com/fidelity-ethereum-etf-dtcc-listing/&word_per_minute=300&desc_truncate_len=210&desc_len_min=180&content_len_min=200' --header 'Authorization: Bearer YOUR_API_KEY'
| Header | Description |
|---|---|
Authorization
|
[Required] Should be Bearer access_key. See "Your API Access Key" above when you are subscribed. |
No long-term commitment. Upgrade, downgrade, or cancel anytime. Free Trial includes up to 50 requests.
Use the API by providing a URL to extract the main content of the article. Set optional parameters to customise the extraction and formatting.
The Text Purify API cleans and extracts relevant text from web pages, removing ads and unwanted content, providing only the main text of the article.
There are different plans suits everyone including a free trial for small amount of requests, but it’s rate is limit to prevent abuse of the service.
Zyla provides a wide range of integration methods for almost all programming languages. You can use these codes to integrate with your project as you need.
The API returns detailed information about the age and history of a domain, including years, months and days since its creation, as well as expiration and update dates.
The GET Article Extract endpoint returns the main content of an article, including the title, description, content, and metadata like the URL and image. The GET Article Proxy Extract endpoint provides similar data but through a proxy for restricted sites.
Key fields in the response include "url" (the article's link), "title" (the article's title), "description" (a brief summary), "content" (the main text), and "image" (a relevant image URL).
The response data is structured in JSON format, with an "error" field indicating success or failure, a "message" field for status updates, and a "data" object containing the extracted article details.
Parameters include "word_per_minute" for reading speed, "desc_truncate_len" for maximum description length, "desc_len_min" for minimum description length, and "content_len_min" for minimum content length.
Users can customize requests by adjusting optional parameters to control reading speed, description length, and content length, allowing for tailored output based on specific needs.
Each endpoint provides the main article text, title, description, image, and links, enabling users to access comprehensive content without ads or irrelevant elements.
Data accuracy is maintained through advanced natural language processing and machine learning techniques that identify and extract relevant content while filtering out ads and non-essential elements.
Typical use cases include content curation, academic research, sentiment analysis, and creating summaries of articles, allowing users to focus on essential information without distractions.
To obtain your API key, you first need to sign in to your account and subscribe to the API you want to use. Once subscribed, go to your Profile, open the Subscription section, and select the specific API. Your API key will be available there and can be used to authenticate your requests.
You can’t switch APIs during the free trial. If you subscribe to a different API, your trial will end and the new subscription will start as a paid plan.
If you don’t cancel before the 7th day, your free trial will end automatically and your subscription will switch to a paid plan under the same plan you originally subscribed to, meaning you will be charged and gain access to the API calls included in that plan.
The free trial ends when you reach 50 API requests or after 7 days, whichever comes first.
No, the free trial is available only once, so we recommend using it on the API that interests you the most. Most of our APIs offer a free trial, but some may not include this option.
Yes, we offer a 7-day free trial that allows you to make up to 50 API calls at no cost, so you can test our APIs without any commitment.
Zyla API Hub is like a big store for APIs, where you can find thousands of them all in one place. We also offer dedicated support and real-time monitoring of all APIs. Once you sign up, you can pick and choose which APIs you want to use. Just remember, each API needs its own subscription. But if you subscribe to multiple ones, you'll use the same key for all of them, making things easier for you.
Please have a look at our Refund Policy: https://zylalabs.com/terms#refund
Service Level:
100%
Response Time:
127ms
Service Level:
100%
Response Time:
1,191ms
Service Level:
100%
Response Time:
227ms
Service Level:
100%
Response Time:
263ms
Service Level:
100%
Response Time:
4,048ms
Service Level:
91%
Response Time:
2,513ms
Service Level:
100%
Response Time:
2,466ms
Service Level:
100%
Response Time:
1,537ms
Service Level:
100%
Response Time:
0ms
Service Level:
100%
Response Time:
166ms
Service Level:
100%
Response Time:
87ms
Service Level:
100%
Response Time:
2,053ms
Service Level:
100%
Response Time:
93ms
Service Level:
100%
Response Time:
955ms
Service Level:
100%
Response Time:
167ms
Service Level:
100%
Response Time:
10ms
Service Level:
100%
Response Time:
78ms
Service Level:
100%
Response Time:
85ms
Service Level:
100%
Response Time:
724ms
Service Level:
100%
Response Time:
97ms