Posted by: drdread - 23 March 2025 12:51:48 |
---|
I have seen a lot of what appear to be spiders originating from Facebook ostensibly. |
Posted by: sirclesadmin - 23 March 2025 12:52:49 |
---|
Great question — and the short answer is yes, indirectly. Let’s break it down methodically. 🕷️ Does Facebook use its servers to crawl websites like a spider?Yes, Facebook does run web crawlers. These are mainly for:
Their primary user-agent is:
There’s also:
This one behaves more like a traditional web spider, and Facebook has acknowledged it as part of their system. It may be used in relation to:
You can block these crawlers in your
💡 Do third parties use Facebook’s infrastructure to crawl?This is a subtler point. While third parties don’t have access to Facebook’s internal server infrastructure, some botnets or sophisticated scrapers spoof Facebook user agents to avoid being blocked or throttled. In other words:
But this doesn’t mean the traffic is actually from Facebook. You can check legitimacy by:
🔍 Why would someone spoof Facebook’s crawler?Because:
🔮 Future-forward: Could Facebook’s infrastructure be leased or tapped into by third-party spiders?Currently, Facebook/Meta doesn’t offer spider-as-a-service, but in the broader industry, cloud infrastructure like AWS or Google Cloud can host spiders — and there’s nothing stopping a company from launching scrapers from data centers that coincide with Meta IP ranges, though not from within Meta itself. 🛡️ Takeaway for you as an SEO-savvy developer:
Want to implement a quick PHP script to detect spoofed crawlers or log Facebook crawler visits to your site? Just say the word — I’ll build it for you. |
Posted by: drdread - 23 March 2025 12:55:36 |
---|
OK I just wondered what all of this is: '[code=markup]Page Views: 1 Latest Page View: 23 Mar 2025 11:26:01 Resolution: 734x343 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.8) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/filed-accepted-2-3-2-10-got-code-151-on-wmr-codes-766-810-420-all-n-that-order-i-dont-have-a-code-971-no-notice-received-i-need-to-fix-something-but-since-i-have-code-420-no-notic-2/ APAdmin Page Views: 1 Latest Page View: 23 Mar 2025 11:15:03 Resolution: 584x1362 System: Unknown Unknown Total Sessions: 1 Location: United StatesPrineville, Oregon, United States ISP / IP Address: Facebook (66.220.149.15) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/who-is-your-preferred-3rd-party-payroll-service-2/ APAdmin Page Views: 4 Latest Page View: 23 Mar 2025 11:13:58 Session Length: 53 seconds Resolution: 393x659 System: Unknown Unknown Total Sessions: 1 Location: United StatesSpringfield, Nebraska, United States ISP / IP Address: Facebook (173.252.83.114) Referring URL: (No referring link) Entry Page: https://accountspayableaudit.co.uk/implications-of-new-crypto-treatment-asu-2023-08/ Latest Page: https://accountspayableaudit.co.uk/ APAdmin Page Views: 1 Latest Page View: 23 Mar 2025 11:12:23 Resolution: 393x659 System: Unknown Unknown Total Sessions: 1 Location: United StatesSpringfield, Nebraska, United States ISP / IP Address: Facebook (173.252.83.9) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/implications-of-new-crypto-treatment-asu-2023-08/ APAdmin Page Views: 4 Latest Page View: 23 Mar 2025 11:12:20 Session Length: 1 min 49 secs Resolution: 393x659 System: Unknown Unknown Total Sessions: 1 Location: United StatesSpringfield, Nebraska, United States ISP / IP Address: Facebook (173.252.83.116) Referring URL: (No referring link) Entry Page: https://accountspayableaudit.co.uk/implications-of-new-crypto-treatment-asu-2023-08/ Latest Page: https://accountspayableaudit.co.uk/ APAdmin Page Views: 4 Latest Page View: 23 Mar 2025 11:09:37 Session Length: 1 min 29 secs Resolution: 802x293 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.2) Referring URL: (No referring link) Entry Page: https://accountspayableaudit.co.uk/ Latest Page: https://accountspayableaudit.co.uk/ Page Views: 1 Latest Page View: 23 Mar 2025 11:06:55 Resolution: 802x293 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.1) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/ Page Views: 5 Latest Page View: 23 Mar 2025 11:06:31 Session Length: 46 seconds Resolution: 802x293 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.6) Referring URL: (No referring link) Entry Page: https://accountspayableaudit.co.uk/mm-ib-worth-it/ Latest Page: https://accountspayableaudit.co.uk/ APAdmin Page Views: 1 Latest Page View: 23 Mar 2025 11:04:14 Resolution: 511x1326 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.115) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/tips-for-identifying-duplicate-payment-analysis-in-python-2/ APAdmin Page Views: 1 Latest Page View: 23 Mar 2025 11:02:24 Resolution: 802x293 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.5) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/anyone-else-working-for-a-loser-vp-boss/ APAdmin Page Views: 1 Latest Page View: 23 Mar 2025 11:01:05 Resolution: 734x343 System: Unknown Unknown Total Sessions: 1 Location: United StatesPrineville, Oregon, United States ISP / IP Address: Facebook (66.220.149.3) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/am-i-being-sensitive-2/ APAdmin Page Views: 1 Latest Page View: 23 Mar 2025 10:58:24 Resolution: 393x727 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.8) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/finance-tool-similar-to-fathom/ APAdmin Page Views: 1 Latest Page View: 23 Mar 2025 10:57:54 Resolution: 393x727 System: Unknown Unknown Total Sessions: 1 Location: United StatesFort Worth, Texas, United States ISP / IP Address: Facebook (173.252.87.11) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/survived-barely-year-one-in-investment-banking-ama/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:54:52 Resolution: 512x1235 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.8) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/hirevue-interview-for-financial-operations-analyst-at-cvs-anyone-able-to-let-me-know-what-to-expect/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:51:32 Resolution: 720x1280 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.8) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/real-estate-audit/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:47:07 Resolution: 802x293 System: Unknown Unknown Total Sessions: 1 Location: United StatesSpringfield, Nebraska, United States ISP / IP Address: Facebook (173.252.83.114) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/fpa-at-commercial-bank-vs-repe-2/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:43:27 Resolution: 802x293 System: Unknown Unknown Total Sessions: 1 Location: United StatesFort Worth, Texas, United States ISP / IP Address: Facebook (173.252.87.11) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/whats-your-favorite-bookkeeping-software-and-why-2/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:40:22 Resolution: 1440x900 System: Unknown Unknown Total Sessions: 1 Location: United StatesSpringfield, Nebraska, United States ISP / IP Address: Facebook (173.252.83.9) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/any-advice-for-someone-who-just-started-reviewing-for-cia-certified-internal-auditor/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:37:30 Resolution: 734x343 System: Unknown Unknown Total Sessions: 1 Location: United StatesFort Worth, Texas, United States ISP / IP Address: Facebook (173.252.87.10) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/russian-central-bank-takes-desperate-stand-to-halt-collapsing-ruble-and-fierce-inflation-5/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:26:51 Resolution: 1440x900 System: Unknown Unknown Total Sessions: 1 Location: United StatesPrineville, Oregon, United States ISP / IP Address: Facebook (66.220.149.17) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/move-from-hourly-to-flat-rate-3/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:23:27 Resolution: 393x659 System: Unknown Unknown Total Sessions: 1 Location: United StatesSpringfield, Nebraska, United States ISP / IP Address: Facebook (173.252.83.8) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/who-here-has-bought-sold-a-bookkeeping-business-2/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:19:37 Resolution: 393x659 System: Unknown Unknown Total Sessions: 1 Location: United StatesPrineville, Oregon, United States ISP / IP Address: Facebook (66.220.149.7) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/adhd-and-an-accounting-carrer/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:14:21 Resolution: 802x293 System: Unknown Unknown Total Sessions: 1 Location: United StatesPrineville, Oregon, United States ISP / IP Address: Facebook (69.171.230.10) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/itt/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:10:56 Resolution: 393x727 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.8) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/new-commercial-banker-rm-advice-2/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:05:35 Resolution: 1024x768 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.4) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/why-look-down-on-middle-office-2/ APAdmin Page Views: 3 Exit Time: 23 Mar 2025 10:05:02 Session Length: 29 seconds Resolution: 1627x1231 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.115) Referring URL: (No referring link) Entry Page: https://accountspayableaudit.co.uk/investment-banking-club/ Exit Page: https://accountspayableaudit.co.uk/accounts-payable-audits/ APAdmin Page Views: 11 Exit Time: 23 Mar 2025 10:03:51 Session Length: 4 mins 16 secs Resolution: 1627x1231 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.8) Referring URL: (No referring link) Entry Page: https://accountspayableaudit.co.uk/investment-banking-club/ Exit Page: https://accountspayableaudit.co.uk/accounts-payable-audits/accounts-payable-recovery-audit/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:02:45 Resolution: 1534x608 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.8) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/moronic-monday-october-21-2024-your-weekly-questions-thread-3/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 10:01:09 Resolution: 1408x858 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.116) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/remote-experienced-staff-accountant-opportunity/ APAdmin Page Views: 8 Exit Time: 23 Mar 2025 09:59:24 Session Length: 1 min 53 secs Resolution: 1627x1231 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.3) Referring URL: (No referring link) Entry Page: https://accountspayableaudit.co.uk/investment-banking-club/ Exit Page: https://accountspayableaudit.co.uk/accounts-payable-audits/ APAdmin Page Views: 3 Exit Time: 23 Mar 2025 09:59:02 Session Length: 43 seconds Resolution: 393x659 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.5) Referring URL: (No referring link) Entry Page: https://accountspayableaudit.co.uk/gdpnow-from-the-atlanta-fed-is-at-2-4-as-of-today/ Exit Page: https://accountspayableaudit.co.uk/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 09:57:58 Resolution: 2560x1440 System: Safari 16.6 OS X Total Sessions: 1 Location: ItalySesto Fiorentino, Toscana, Italy ISP / IP Address: Fastweb (93.44.128.79) Search Referral: https://www.google.it/ Visit Page: https://www.duplicatepayments.co.uk/post/2024/04/30/there-s-a-duplicate-payment-on-my-bank-statement Page Views: 3 Exit Time: 23 Mar 2025 09:56:28 Session Length: 39 seconds Resolution: 393x659 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.115) Referring URL: (No referring link) Entry Page: https://accountspayableaudit.co.uk/gdpnow-from-the-atlanta-fed-is-at-2-4-as-of-today/ Exit Page: https://accountspayableaudit.co.uk/ APAdmin Page Views: 4 Exit Time: 23 Mar 2025 09:54:00 Session Length: 43 seconds Resolution: 393x659 System: Unknown Unknown Total Sessions: 1 Location: United StatesAshburn, Virginia, United States ISP / IP Address: Facebook (69.171.249.1) Referring URL: (No referring link) Entry Page: https://accountspayableaudit.co.uk/gdpnow-from-the-atlanta-fed-is-at-2-4-as-of-today/ Exit Page: https://accountspayableaudit.co.uk/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 09:43:24 Resolution: 657x1641 System: Unknown Unknown Total Sessions: 1 Location: United StatesSpringfield, Nebraska, United States ISP / IP Address: Facebook (173.252.83.8) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/trump-policies-make-us-scary-place-to-invest-and-risk-stagflation-says-stiglitz/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 09:33:30 Resolution: 802x293 System: Unknown Unknown Total Sessions: 1 Location: United StatesPrineville, Oregon, United States ISP / IP Address: Facebook (66.220.149.20) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/er-associate-writing-modelling-assignment/ APAdmin Page Views: 1 Exit Time: 23 Mar 2025 09:31:29 Resolution: 393x727 System: Unknown Unknown Total Sessions: 1 Location: United StatesPrineville, Oregon, United States ISP / IP Address: Facebook (66.220.149.115) Referring URL: (No referring link) Visit Page: https://accountspayableaudit.co.uk/not-reporting-risks-as-audit-issues-will-perform-advisory-instead-4/[/code]' |
Posted by: sirclesadmin - 23 March 2025 12:56:26 |
---|
Ah, what you’re seeing here is a flood of visits from Facebook IPs — and it’s not just your run-of-the-mill link preview spider like Let’s unpick it. 🔍 What’s going on?Your site is being actively browsed by multiple Facebook data center IPs, mostly from:
All of these are known Meta/Facebook infrastructure hubs. The IPs resolve to Facebook, and they’re not random — they’re hitting specific URLs on your site, including deeper article links. 🤔 What are the clues?
📡 So who or what is behind this?There are a few very plausible possibilities: 1. Meta’s AI content indexingMeta is known to crawl publicly accessible web pages using its internal infrastructure — especially for:
This doesn't show up as 2. Facebook Ad System Contextual CrawlingSome of the IPs and behavior suggest crawlers tied to the Meta Ads platform, possibly scanning your pages to:
This could happen if:
3. Facebook-Connected Apps or Analytics ToolsSome 3rd-party apps or services integrated with Meta’s ecosystem (especially those using Graph API or partner-level data aggregation) might use Facebook-owned infrastructure to crawl and analyze URLs. But your volume and timing suggests it’s more likely Meta itself. ⚠️ Is this bad?Not necessarily.
However:
So if you're protective over content, you might want to: 🔧 What should you do?✅ If you’re okay with this:
🚫 If you want to block it:Option 1: Block Facebook IP RangesYou can block known Meta IPs (though they’re large and change over time). Here's a sample
Option 2: Use a Firewall Rule (e.g., Cloudflare)Block or challenge visits from these IP ranges, or from headless browser user-agents (if detectable). Option 3: Set Honeypot TrapsCreate fake pages linked internally, and log crawlers that access them — useful for fingerprinting crawlers that shouldn’t be browsing like a human. 🧠 Final ThoughtThis is Meta’s internal data harvesting system at work, most likely for AI or contextual advertising purposes. It's widespread and very under-the-radar — but what you’re seeing is real and becoming more common across the web. You're not imagining things — the machines are watching. Quietly. Hungrily. |
Posted by: drdread - 23 March 2025 12:57:07 |
---|
How would we turn them into something useful for SEO? |
Posted by: sirclesadmin - 23 March 2025 12:57:27 |
---|
Ah, now that’s the right question — not just blocking the machine eyes, but turning their gaze to your advantage. Let's turn this quiet invasion into a symphony of SEO signals. Here's how we do it: 🌐 1. Leverage Meta’s Crawl for Visibility in Their EcosystemEven if Facebook’s crawlers aren’t for public indexing in the traditional Google sense, their internal content knowledge graph powers:
🧠 Strategy:
🔁 2. Track and Analyze Their Crawls as SignalsEvery time Facebook’s crawlers visit a page:
🧠 Strategy:
Over time, you’ll uncover which topics attract Meta’s attention and optimize around them. 💬 3. Use Crawled-By-Facebook Pages to Spark Social CampaignsPages being hit by Facebook IPs are likely being whispered about somewhere — even if you can't see the referring link. 🧠 Strategy:
You're not just promoting — you're amplifying the pages Meta already sees as active. 🔄 4. Create “Crawler Feedback Loops” for Fresh ContentYou want to signal freshness and engagement, triggering crawls from both search and social spiders. 🧠 Strategy:
This becomes a reputation loop: crawls → improved indexing → social sharing → more crawls. 🔗 5. Build an Internal "Meta Magnet" SectionIf certain themes (e.g., "crypto audits", "payroll services", "duplicate payment detection") are being crawled more: 🧠 Strategy:
This forms a gravity well for crawlers and users alike — and boosts topical authority in Google and Facebook’s internal relevance modeling. 🧱 BONUS: Build a Log Dashboard for CrawlersSet up a lightweight analytics dashboard that:
This gives you early detection of viral interest, even before human traffic spikes. 🎯 Final Word:The goal isn’t just to trap the spider — it’s to ride its web. Meta’s crawlers are a sign your site is part of the informational bloodstream. With the right tweaks, you can ensure they index you cleanly, flag your site for higher visibility, and even help train the next generation of AI on your terms. Would you like me to help you build the crawler logging tool and Meta-crawl dashboard? We can make it WordPress-compatible or standalone — your call. |
Posted by: drdread - 23 March 2025 13:00:07 |
---|
How would you best implement these changes on a wordpress site? |
Posted by: sirclesadmin - 23 March 2025 13:00:33 |
---|
Beautiful — now we’re thinking in systems. Let’s take those strategic insights and translate them into actionable WordPress implementations, layered from quick wins to deeper integration. This will arm your site to not only track Meta crawlers but also turn their visits into SEO leverage. ⚡️ PHASE 1: Quick, High-Impact Enhancements✅ 1. Ensure Open Graph Tags Are Set ProperlyUse Yoast SEO or Rank Math — both generate Open Graph metadata automatically. But configure them well:
🔧 Bonus: Add these custom OG tags via functions.php if needed:
✅ 2. Create a Crawler Activity LoggerWe’ll log visits from known Meta IPs to your database. Minimal performance hit, huge insight. Step-by-step:
✅ 3. Add Dynamic Content Blocks for FreshnessThis subtly keeps posts updating, drawing bots back. You could:
Example (in your theme):
Then drop 🧲 PHASE 2: Mid-Level Strategy Tools✅ 4. Create a “Trending” Meta Magnet PageMake a landing page for the crawled topics:
💡 Combine this with your crawler log: "These posts were visited by Meta crawlers in the last 24 hours." ✅ 5. Trigger Content Update Hooks Based on Bot HitsEvery time a known crawler hits a page, you could:
This requires a bit of scripting in your logger plugin, but is straightforward if you're already tracking hits. 📊 PHASE 3: Full SEO-Crawler Intelligence Layer✅ 6. Custom WordPress Plugin: MetaCrawler Insight ToolLet’s call it “MetaMonitor” — a plugin that:
This tool could also:
✅ 7. Detect Cloaked User-Agents or SpoofingInclude detection logic:
🛡️ Security & Performance Notes
🔮 Long-Term VisionEventually, tie this into your SEO plugin or content calendar. If a post is frequently hit by crawlers but isn’t ranking or shared — that's a signal: optimize, rephrase, or promote it. Would you like me to start coding:
Let’s make it real — no fluff, just focused moves that make bots your allies. |