Market Research · Content Teams

AI Agent for Fetching latest RSS articles and storing non-duplicates in Google Sheets

Monitors RSS feeds, validates feeds, fetches the latest articles, deduplicates against existing records, and stores clean results in Google Sheets.

How it works
1 Step
Prepare & Validate URLs
2 Step
Fetch Latest Articles
3 Step
Deduplicate & Store
Collects RSS feed URLs and validates which feeds are technically reachable and provide valid RSS content.

Overview

This AI agent automatically collects RSS feed URLs, validates them, fetches the latest articles, and stores unique items in Google Sheets. It operates end-to-end from feed discovery to record storage, consolidating your content in a single, auditable sheet. It reduces manual checks and provides a verifiable audit trail for curations.

# RSS Feed Latest Article Fetcher Workflow This n8n workflow fetches URLs from an RSS feed, checks which URLs have a valid RSS feed and if true, fetches the latest articles from those URLs. It then stores the article details, including the article link, in Google Sheets. ### Quick Implementation Steps 1. Import this JSON workflow into n8n. 2. Connect your application or RSS feed URLs to be checked. 3. Add Google Sheets API credentials to store the fetched articles. 4. Enable the workflow — done! ## What It Does 1. **Fetches RSS Feed URLs** - Retrieves a list of URLs from an RSS feed source. 2. **Checks Each URL for RSS Feed** - Verifies which URLs contain a valid RSS feed. 3. **Fetches Latest Articles** - For URLs with valid RSS feeds, it fetches the latest articles from those feeds. 4. **Stores Articles in Google Sheets** - Adds the article data, including article links, to a Google Sheet for record-keeping. ## Who’s It For This workflow is ideal for: - Content aggregators looking to fetch articles from multiple RSS feeds. - Teams needing to automate article collection and storage for easy reference. - Organizations aiming to centralize their content curation process in Google Sheets. ## Requirements - n8n (Self-Hosted or Cloud) - Google Sheets API credentials with permissions to append data - A list of RSS feed URLs to check ## How It Works 1. **Trigger**: Workflow is triggered by clicking "Execute Workflow". 2. **Fetch RSS URLs**: Retrieves a list of URLs to check for RSS feeds. 3. **Check for RSS Feed**: Validates whether the URLs have a valid RSS feed. 4. **Fetch Latest Articles**: For valid RSS feeds, the workflow fetches the latest articles. 5. **Store in Sheets**: The fetched articles, along with their links, are added to a Google Sheet. 6. **Stop if No Articles Found**: If no valid articles are found, the workflow automatically stops. ## Setup Instructions 1. **Import Workflow**: _Workflows → Import from File_ in n8n. 2. **RSS Feed URL Setup**: Input the URLs you want to check for RSS feeds. 3. **Google Sheets Setup**: Connect your Google Sheets account and configure the spreadsheet where articles will be stored. 4. **Run the Workflow**: Trigger the workflow manually by clicking "Execute Workflow" and monitor the results in your Google Sheet. ## Logic Overview | Step | Node Description | |----------------------------------------|----------------------------------------------------------------------------------| | **1. Trigger** | **Execute Workflow**: Initiates the workflow process when triggered. The workflow begins when you manually execute it. | | **2. Fetch RSS URLs** | **RSS Feed**: Retrieves a list of RSS URLs from Google Sheets. | | **3. Check for RSS Feed** | **Check RSS URLs**: Verifies whether each URL has a valid RSS feed using an HTTP Request node. | | **4. Fetch Latest Articles** | **Fetch Each URL**: Fetches the latest articles from valid RSS feeds using the RSS Feed Read node. | | **5. Fetch Existing Titles** | **Fetch Existing Title**: Retrieves stored titles from Google Sheets for comparison. | | **6. Check for Duplicates** | **Check if Title Exists**: Compares new titles with existing ones. Duplicates are skipped; new ones proceed. | | **7. Store Only Non-Duplicate Titles** | **Append Row in Sheets**: Stores new (non-duplicate) articles in Google Sheets. | | **8. Optional Message Node** | **Message**: Logs duplicate events (e.g., "Data already exists"). | | **9. Stop if No Articles Found** | **Stop**: Halts workflow if no new articles are found or added. | ## Customization Options - **RSS URL List**: Adjust the list of URLs to fetch articles from. - **Google Sheets**: Modify the Google Sheets integration to store additional data (e.g., article content, author). - **Error Handling**: Add custom error messages if no RSS feed is found or no articles are fetched. ## Troubleshooting - **No Articles Fetched**: Ensure the RSS feed URLs are correctly formatted and active. - **Google Sheets Issues**: Verify that the Google Sheets API credentials are correctly configured and the sheet is accessible. - **Workflow Not Triggering**: Check the execution trigger settings to ensure the workflow is activated manually. ## Use Case Examples 1. **Blog Aggregator**: Automatically fetch and store the latest blog articles from multiple RSS feeds into a central Google Sheet for analysis. 2. **Content Curation**: Gather the latest articles from industry-related RSS feeds and share them with your team. ## Need Help? If you need help setting up, customizing or extending this workflow, WeblineIndia can assist with full [n8n development](https://www.weblineindia.com/n8n-automation/), workflow automation, content fetching systems and custom integrations.


Capabilities

What RSS Article Fetcher does

A concise explanation of the end-to-end automation the agent performs.

01

Fetches RSS feed URLs from a source list.

02

Checks each URL for a valid RSS feed.

03

Fetches the latest articles from valid RSS feeds.

04

Deduplicates articles against existing Google Sheets entries.

05

Appends non-duplicate articles to Google Sheets.

06

Logs duplicate or failed fetch events for auditing.

Why you should use AI Agent for Fetching RSS Articles and Storing Non-Duplicates in Google Sheets

Two sentences explain the practical rationale for use. The before/after contrasts below illustrate how the workflow changes. It automates repetitive steps and provides a reliable record of captured content.

Before
Manual RSS URL collection is inconsistent and time-consuming.
Duplicates are often added due to lack of dedup logic.
RSS feed validation is manual and error-prone.
Storing articles in sheets is fragmented and hard to audit.
No automated auditing or error logging for failed fetches.
After
All valid RSS articles are captured automatically in Google Sheets.
Only non-duplicate articles are stored, ensuring clean data.
Feed validation runs automatically, reducing broken feeds.
Error logging provides traceable issues for quick fixes.
Workflow can be re-run on demand with consistent results.
Process

How it works

A simple 3-step flow that non-technical users can follow.

Step 01

Prepare & Validate URLs

Collects RSS feed URLs and validates which feeds are technically reachable and provide valid RSS content.

Step 02

Fetch Latest Articles

Retrieves the most recent articles from each validated RSS feed.

Step 03

Deduplicate & Store

Compares new articles against existing sheet entries and appends only non-duplicates to Google Sheets.


Example

Example workflow

A realistic scenario showing time, task, and outcome.

Scenario: A content team subscribes to 20 RSS feeds. The agent runs daily at 09:00 UTC, fetches 5 new articles, and appends them to a Google Sheet with one deduplication pass. Outcome: 5 new clean rows added, with duplicates skipped and an audit log noting the activity.

Market Research Google SheetsRSS Feed Parser AI Agent flow

Audience

Who can benefit

Roles that gain from automated RSS capture and storage.

✍️ Content Marketers

Need a centralized source of latest articles for quick reference and outreach.

💼 Content Curation Teams

Require up-to-date sources organized in a single sheet for quick sharing.

🧠 Newsroom Editors

Need to monitor multiple feeds and maintain a clean archive of articles.

Market Researchers

Benefit from deduplicated data for quick analysis and reporting.

🎯 Product Teams

Want to track industry mentions and content for competitive intel.

📋 Marketing Agencies

Need a repeatable process to populate client dashboards with fresh content.

Integrations

Core tools used and how the agent uses them inside each tool.

Google Sheets

Appends new articles and checks for duplicates to maintain a clean dataset.

RSS Feed Parser

Parses feeds to extract article metadata (title, link, pubDate) and makes it available for storage.

Applications

Best use cases

Practical scenarios where this AI agent shines.

Blog aggregators consolidate updates from many feeds into a single sheet for analysis.
Marketing teams maintain media lists by automatically collecting recent posts from industry sources.
Content desks build reference libraries with deduplicated article records.
Research teams create audit-ready RSS-based archives for reporting.
Product teams track competitors’ content for trend analysis.
Agencies deliver client dashboards updated with the latest articles.

FAQ

FAQ

Common questions about using the AI agent in real workflows.

The agent supports feeds that are publicly accessible. For authenticated feeds, credential handling can be configured within the workflow to fetch and normalize access. If a feed requires login, you can provide the necessary credentials in a secure store and reference them in the fetch step. You should ensure proper security controls when storing and using credentials. Test with a small subset of feeds before full-scale automation.

The agent compares new article titles and links against existing entries in Google Sheets. It can also consider publication dates as a secondary check. Duplicates are skipped during the append step, ensuring only unique items are stored. If you want stricter dedup rules, we can extend the comparison to include article content snippets or IDs.

The agent proceeds to the next feed and, if no feeds yield new content, logs the result and can stop based on your configuration. You can configure a threshold for maximum runs without new articles to trigger a halt. This behavior prevents unnecessary writes and keeps your sheet clean.

Yes. The agent stores configurable fields such as title, link, publication date, and summary. You can extend storage to include author, categories, or custom metadata. Any new fields require updating the Google Sheets schema and the fetch/parsing steps.

The run frequency is configurable at the workflow level. You can set daily, hourly, or event-triggered runs depending on your needs. Frequent runs provide fresher data but may require rate-limiting considerations depending on feed sources.

Yes. Each run produces a log of actions, including which feeds were checked, the number of new articles found, and any duplicates or fetch errors. Logs facilitate troubleshooting and compliance. You can export logs for audits or attach them to reports.

No. The agent is designed to be plug-and-play within a visual automation tool. Basic configuration involves providing feed URLs, a Google Sheets target, and credentials. We provide step-by-step setup instructions and can tailor the workflow to fit your exact data schema.


AI Agent for Fetching latest RSS articles and storing non-duplicates in Google Sheets

Monitors RSS feeds, validates feeds, fetches the latest articles, deduplicates against existing records, and stores clean results in Google Sheets.

Use this template → Read the docs