Skip to content

如何在线抓取推文:使用 Apify 平台快速获取 Twitter 数据,无需代码

Published: at 12:00 PMSuggest Changes

引言

想要获取 Twitter 数据但不想写代码?本文将介绍如何使用 Apify 平台快速抓取推文数据。通过简单的在线操作,您就能获取需要的 Twitter 数据,无需编程经验。

Table of contents

Open Table of contents

什么是 Apify?

Apify 是一个强大的网络抓取和自动化平台,它提供了多种预构建的工具(Actor)来抓取各类网站数据。对于 Twitter 数据抓取,Apify 提供了多个专门的 Actor:

使用步骤

1. 注册 Apify 账号

  1. 访问 Apify 官网
  2. 点击右上角的 “Sign up” 按钮
  3. 可以使用 Google 账号或邮箱注册
  4. 新用户可获得免费积分用于测试

2. 选择合适的 Actor

根据您的需求选择合适的 Actor:

3. 配置抓取参数

以 Tweet Fast Scraper 为例,您需要设置以下参数:

{
  "username": "elonmusk",
  "startTime": "2024-12-07_00:00:00_UTC",
  "endTime": "2024-12-08_23:59:59_UTC",
  "sortBy": "Latest",
  "maxItems": 100,
  "minRetweets": 0,
  "minLikes": 0,
  "minReplies": 0,
  "onlyVerifiedUsers": false,
  "onlyBuleVerifiedUsers": false
}

参数说明:

4. 运行 Actor

  1. 点击 “Start” 按钮开始运行
  2. 等待任务完成(通常只需几秒)
  3. 在 “Last run” 标签页查看结果

5. 导出数据

Apify 支持多种数据导出格式:

数据示例

以下是抓取到的推文数据示例:

{
        "type": "tweet",
        "id": 1843718588299129000,
        "viewCount": "67402157",
        "url": "https://x.com/elonmusk/status/1843718588299129191",
        "twitterUrl": "https://twitter.com/elonmusk/status/1843718588299129191",
        "text": "Encourage your friends in swing states to sign the petition in support of free speech & right to bear arms and earn $47 for doing something you already believe in!",
        "isQuote": true,
        "retweetCount": 34150,
        "replyCount": 10876,
        "likeCount": 186465,
        "quoteCount": 1498,
        "createdAt": "Tue Oct 08 18:22:20 +0000 2024",
        "lang": "en",
        "quoteId": "1843714260725367089",
        "bookmarkCount": 4137,
        "isReply": false,
        "entities": {
            "hashtags": [],
            "symbols": [],
            "timestamps": [],
            "urls": [],
            "user_mentions": []
        },
        "source": "Twitter for iPhone",
        "author": {
            "type": "user",
            "username": "elonmusk",
            "url": "https://x.com/elonmusk",
            "twitterUrl": "https://x.com/elonmusk",
            "id": "44196397",
            "name": "Elon Musk",
            "isVerified": false,
            "isBlueVerified": true,
            "verifiedType": null,
            "profilePicture": "https://pbs.twimg.com/profile_images/1858316737780781056/kPL61o0F_normal.jpg",
            "coverPicture": "https://pbs.twimg.com/profile_banners/44196397/1726163678",
            "description": "The people voted for major government reform",
            "location": "",
            "followers": 206745525,
            "following": 866,
            "protected": false,
            "status": null,
            "canDm": false,
            "canMediaTag": false,
            "createdAt": "Tue Jun 02 20:12:29 +0000 2009",
            "advertiserAccountType": null,
            "analyticsType": null,
            "entities": {
                "description": {
                    "urls": []
                }
            },
            "fastFollowersCount": 0,
            "favouritesCount": 98546,
            "geoEnabled": false,
            "hasCustomTimelines": true,
            "hasExtendedProfile": false,
            "isTranslator": false,
            "mediaCount": 2917,
            "profileBackgroundColor": "",
            "statusesCount": 61789,
            "translatorTypeEnum": "none",
            "withheldInCountries": [],
            "affiliatesHighlightedLabel": {
                "label": {
                    "url": {
                        "url": "https://twitter.com/X",
                        "urlType": "DeepLink"
                    },
                    "badge": {
                        "url": "https://pbs.twimg.com/profile_images/1683899100922511378/5lY42eHs_bigger.jpg"
                    },
                    "description": "X",
                    "userLabelType": "BusinessLabel",
                    "userLabelDisplayType": "Badge"
                }
            }
        },
        "quote": {
            "type": "tweet",
            "id": "1843714260725367089",
            "text": "The first $47 payments were hand-delivered in Philly last night as a surprise to early signers!\n\nSign the petition and refer your friends and receive $47 for each registered swing state voter you refer that signs.\n\nhttps://t.co/PgHAgrmrv8 https://t.co/F8xd9ZCwI0",
            "retweetCount": 2798,
            "replyCount": 1695,
            "likeCount": 11387,
            "quoteCount": 1061,
            "createdAt": "Tue Oct 08 18:05:08 +0000 2024",
            "lang": "en",
            "bookmarkCount": 1003,
            "author": {
                "type": "user",
                "username": "america",
                "url": "https://x.com/america",
                "twitterUrl": "https://x.com/america",
                "id": "1841988608825466880",
                "name": "America",
                "isVerified": false,
                "isBlueVerified": true,
                "verifiedType": null,
                "profilePicture": "https://pbs.twimg.com/profile_images/1842003465448398848/U3EZmdUz_normal.jpg",
                "coverPicture": "https://pbs.twimg.com/profile_banners/1841988608825466880/1728002939",
                "description": "PAC Founded by @ElonMusk to support candidates who champion Secure Borders, Sensible Spending, Safe Cities, Fair Justice System, Free Speech and Self-Protection",
                "location": "USA",
                "followers": 883619,
                "following": 0,
                "protected": false,
                "status": null,
                "canDm": false,
                "canMediaTag": true,
                "createdAt": "Thu Oct 03 23:48:27 +0000 2024",
                "advertiserAccountType": null,
                "analyticsType": null,
                "entities": {
                    "description": {
                        "urls": []
                    },
                    "url": {
                        "urls": [
                            {
                                "display_url": "theamericapac.org",
                                "expanded_url": "https://theamericapac.org",
                                "url": "https://t.co/PdGFm7qcgm",
                                "indices": [
                                    0,
                                    23
                                ]
                            }
                        ]
                    }
                },
                "fastFollowersCount": 0,
                "favouritesCount": 115,
                "geoEnabled": false,
                "hasCustomTimelines": false,
                "hasExtendedProfile": false,
                "isTranslator": false,
                "mediaCount": 544,
                "profileBackgroundColor": "",
                "statusesCount": 917,
                "translatorTypeEnum": "none",
                "withheldInCountries": [],
                "affiliatesHighlightedLabel": {}
            }
        }
    }

高级功能

1. 定时任务

Apify 支持设置定时任务,可以:

2. Webhook 集成

可以设置 Webhook 在任务完成时:

3. API 调用

提供 API 接口实现:

使用技巧

  1. 优化抓取效率

    • 设置合适的时间范围
    • 使用精确的筛选条件
    • 避免抓取过大数据量
  2. 数据质量控制

    • 验证数据完整性
    • 检查数据格式
    • 注意时区设置
  3. 成本控制

    • 合理使用免费积分
    • 选择适当的计划
    • 优化任务配置

常见问题

Q: 需要编程知识吗?

A: 不需要,Apify 提供了友好的图形界面,只需简单配置即可使用。

Q: 数据更新频率如何?

A: 可以自定义设置,支持分钟级到天级的更新频率。

Q: 数据是否实时?

A: 是的,Apify 可以获取最新的推文数据,延迟通常在分钟级别。

Q: 支持哪些语言的推文?

A: 支持所有 Twitter 支持的语言,可以通过语言参数筛选。

总结

使用 Apify 平台抓取 Twitter 数据是一个简单高效的解决方案。它不仅降低了技术门槛,还提供了强大的功能和灵活的配置选项。无论是个人研究还是商业分析,都能满足不同场景的需求。

扩展资源

记住要遵守 Twitter 的使用条款和 Apify 的服务条款。如果您需要更多支持,可以加入我们的 讨论组


Next Post
How to configure AstroPaper theme