返回首页

对比阅读

对比阅读:Vercel 开源工具让 AI 能「点按钮、发微博」— 浏览器自动化正在从程序员专属变成普通人也能用的东西 与 Vercel 开源工具让 AI 能「点按钮、发微博」— 浏览器自动化正在从程序员专属变成普通人也能用的东西

AEN
VercelAgent Browserbrowser automation·

Vercel 开源工具让 AI 能「点按钮、发微博」— 浏览器自动化正在从程序员专属变成普通人也能用的东西

What This Is

Until now , AI could read a webpage but not act on one. It could parse HTML, but it couldn't click " Submit," type into a search box, or like a post on your behalf. That gap has kept a large class of seemingly simple tasks firmly out of AI's reach.

Ver cel Labs—the experimental division of U.S. cloud platform Vercel—has open-sourced a command-line tool called Agent Browser, built specifically to let AI control a browser. Once installed, you give the AI a plain -language instruction—"Go to Weibo and like the first CCTV News post"—and it opens a browser, finds the button, and does it. No code required on your end.

One detail makes this especially practical: Agent Browser can connect directly to your running Chrome instance and inherit your existing login sessions. You don't re-enter passwords. If you're already logged into a platform, the AI can act as you, immediately .

How the Industry Sees It

Proponents argue that tools like this mark a genuine inflection point—AI automation is migrating from "requires custom scripts" to "requires a sentence." Traditional browser automation forced developers to hard -code the exact position of every button in a lengthy script. Agent Browser's approach lets the AI interpret the page's structure itself and decide how to interact with it. That collapses a meaningful amount of setup friction.

The obj ections and risks deserve equal attention. The first is security: granting AI access to your authenticated browser session means that if the AI misreads an instruction—or is fed a malicious one—it could post, delete content, or trigger a payment without your awareness. Second, " just say what you want" still requires completing installation and authorization steps that will stop most non-technical users before they get started. The gap between the marketing pitch and the actual setup experience remains real. Third, major platforms—Weibo, Xiaohongshu, X, and others—run active anti-bot and anti-scraping systems; anyone using these tools at scale faces meaningful account-suspension risk.

What This Means for Regular People

For enterprise IT: The most immediate use cases are internal process automation—scheduled screenshots for archiving, cross-system data entry, bulk UI testing. IT teams should evaluate which repet itive manual workflows this can absorb, while establishing clear permission boundaries to prevent the AI from touching sensitive systems it shouldn't.

For individual professionals: Roles that involve high volumes of repetitive web tasks—data collection, content publishing, competitor monitoring—have a new efficiency option here. Our read: in the near term, this is more " one more tool for people who learn it" than "an existential threat to people who don't."

For the consumer market: As browser automation becomes easier to deploy, consumers will likely feel the effects ind irectly—through merchants using it to manage reviews and inventory at scale, or through platforms deploying it to detect anomalous behavior. The cat-and-mouse dynamic that follows is worth watching more closely than the tool itself.

来源: juejin.cn
BZH
VercelAgent Browser浏览器自动化·

Vercel 开源工具让 AI 能「点按钮、发微博」— 浏览器自动化正在从程序员专属变成普通人也能用的东西

这是什么

过去,AI 能「读」网页,却不能「动」网页——它可以分析一段 HTML 代码,但无法替你点击「提交」按钮、在搜索框里输入内容、或者帮你给某条微博点赞。这道墙,让大量看起来很简单的任务,实际上 AI 根本无从下手。

Vercel Labs(美国云平台 Vercel 的实验部门)近期开源了一个叫 Agent Browser 的命令行工具(CLI,即通过文字指令控制电脑的工具),专门为 AI 操控浏览器而设计。安装之后,你只需要用自然语言告诉 AI「去微博,给央视新闻第一条点赞」,它就能真的打开浏览器、找到按钮、完成操作。

这个工具有一个特别实用的细节:它可以直接连接你正在使用的 Chrome 浏览器,沿用你已有的登录状态,不需要重新输密码。换句话说,只要你平时已经登录了某个平台,AI 就能直接以你的身份去操作。

行业怎么看

支持者认为,这类工具的出现,标志着 AI 自动化(让 AI 代替人完成一系列连续动作)正在从「写代码才能用」向「说话就能用」迁移。以往做浏览器自动化,开发者需要手写复杂的脚本,指定每一个按钮的精确位置。Agent Browser 的思路是让 AI 自己「看懂」页面结构,再决定怎么操作—— 这降低了相当大的使用门槛。

但反对意见和风险同样值得正视。首先是安全隐患:允许 AI 以你的登录身份操作浏览器,意味着一旦 AI 理解出错,或者被恶意指令干扰,它可能在你不知情的情况下发帖、删除内容,甚至触发支付动作。其次,这类工具目前仍需要一定的技术基础才能配置,宣传中的「一句话搞定」,前提是你先完成了安装和授权的若干步骤——对非技术用户来说,这道门槛依然存在。此外,各大平台(微博、小红书、X 等)对自动化操作有明确的反爬和反机器人机制,规模化使用面临封号风险。

对普通人的影响

对企业 IT:这类工具最直接的落地场景是内部流程自动化——定期截图存档、跨系统录入数据、批量测试网页功能。IT 部门值得评估它能替代哪些人工重复操作,但同时需要建立明确的权限边界,防止 AI 越权操作敏感系统。

对个人职场:对于日常需要处理大量重复性网页操作的岗位(数据收集、内容发布、竞品监控),这类工具提供了一个新的效率方向。我们的判断是:短期内它更像是「会用的人多一个工具」,而非「不会用的人面临威胁」。

对消费市场:当浏览器自动化工具变得足够易用,消费端最先感受到的可能是服务质量的变化——无论是商家用它批量管理评论和库存,还是平台用它检测异常行为。这场猫鼠游戏会比工具本身更值得关注。

来源: juejin.cn