557 Commits

Author SHA1 Message Date
Leyang
b76211bd5d
feat: android playground (#542)
* refactor: android api

* refactor: enhance Android agent to accept options for device connection

* fix: type error

* fix: click after clearInput

* fix: click before clearInput

* feat: android playground

* feat: support npx package name

* feat: android playground joint

* fix: git ignore conflicts

* feat: ensure adb server is running before initializing adb client

* fix: deps consistency

* ci: add android playground

* feat: integrate shared constants and improve server configuration in android playground

* feat: android playground style

* feat: style opt

* feat: add @rsbuild/plugin-svgr dependency and improve URI handling in adb

* feat: remove unused water flow scripts and update comments to English

* feat: download report file

* feat: standalone android playground

* feat: use dynamic import

* feat: migrate CSS to LESS and remove unused styles in chrome extension and report

* feat: enhance Android playground with ScrcpyPlayer ref integration and device management improvements

* feat: optimize styles and layout in Android playground and visualizer components

* chore: add bin back

* chore: update build script to exclude documentation generation

* feat: add not ready message to PlaygroundResult for improved user guidance

* feat: add error handling for screenshot capture in Android page

* docs: update readme

* feat: add PNG validation for screenshot buffer in Android page

* feat: enhance UI components with improved styling and tooltips in ScrcpyPlayer and PromptInput

* docs: update uri parameter description in integrate-with-android documentation and improve uri handling in launch function

* style: update primary color to #2B83FF across multiple components and adjust margin in App.less

* refactor: replace userConfig with globalConfig for environment configuration management and update related functions

* feat: integrate server validation logic in App, AdbDevice, and ScrcpyPlayer components for improved connection handling

* style: enhance player component layout with overflow handling and margin adjustments

* style: refine player component layout with flex adjustments and improved spacing

* feat: add midscene model name display and improve layout in EnvConfig component

* feat: integrate ShinyText component for enhanced loading progress display in PlaygroundResult

* test: add test for isValidPNGImageBuffer

* style: remove background color from App.less and adjust AI config override behavior in env.ts

---------

Co-authored-by: yutao <yutao.tao@bytedance.com>
2025-04-17 17:44:11 +08:00
yuyutaotao
a1b5a54d89
fix(report): do not call mkdir in browser (#577)
* fix: log dir

* fix: log dir

* fix: gitignore config
2025-04-17 15:09:59 +08:00
yuyutaotao
824be26c85
fix: use tmpdir as a fallback for log file (#575)
---------

Co-authored-by: zhouxiao.shaw <zhouxiao.shaw@bytedance.com>
2025-04-17 10:54:19 +08:00
yuyutaotao
dd233ef25b
chore: adapting new model (#570)
* chore: adapting new model

* feat: adapt bbox format

* chore: adapt ui-tars
---------

Co-authored-by: zhouxiao.shaw <zhouxiao.shaw@bytedance.com>
2025-04-17 10:53:19 +08:00
yuyutaotao
445381efef
feat(core): show warning for oversized image (#564)
---------

Co-authored-by: zhouxiao.shaw <zhouxiao.shaw@bytedance.com>
2025-04-17 10:51:58 +08:00
yuyutaotao
c777c139e4
fix: puppeteer export (#569) 2025-04-17 10:50:16 +08:00
Leyang
80205b4625
feat: add uitars as vl mode (#565)
* feat: add uitars as vl mode

* feat: add uitars as vl mode

* refactor: simplify model description logic in insight utilities

* chore: warn for conflict configs

---------

Co-authored-by: yutao <yutao.tao@bytedance.com>
2025-04-16 11:12:56 +08:00
Leyang
258dba3011
fix: dry run (#566) 2025-04-15 17:46:28 +08:00
Leyang
3793e91899
docs: add android playground docs (#563)
* docs: add android playground docs

* chore: update pic

* docs: distinct android and web

* chore: update docs

* chore: update docs

---------

Co-authored-by: yutao <yutao.tao@bytedance.com>
2025-04-15 11:31:12 +08:00
yuyutaotao
edf48da198
docs: update caching docs (#559)
* docs: update caching docs

* docs: update caching docs

* docs: update docs

* chore: update docs
2025-04-14 14:50:48 +08:00
yuyutaotao
027283aaac
feat: disable http warning in yaml mode (#556) 2025-04-11 10:04:05 +08:00
Leyang
0ca9fda7ae
feat: android yaml support (#551)
* feat: android yaml support

* feat: enhance ScriptPlayer to support web and android environments

* chore: update error message

* refactor: unify environment interfaces for YAML scripts

---------

Co-authored-by: yutao <yutao.tao@bytedance.com>
2025-04-11 09:36:41 +08:00
Leyang
570c2d7294
fix: rethrow error with android faq when adb error occured (#554) 2025-04-10 20:51:56 +08:00
github-actions[bot]
1bd873b360 release: v0.14.3 2025-04-10 09:10:39 +00:00
yuyutaotao
9be848b651
feat: allow insecure https in yaml (#553) 2025-04-10 16:57:46 +08:00
yuyutaotao
66a5d16d37
chore: update evaluation fixture (#552) 2025-04-10 16:54:21 +08:00
Leyang
fbaa314520
feat(android): implement custom method to retrieve screen size and simplify size handling (#544)
* feat(android): implement custom method to retrieve screen size and simplify size handling

* fix: use debugPage

* feat: print device info
2025-04-09 10:54:39 +08:00
Leyang
0bbe416151
fix(android): verify vl-model mode in AndroidAgent constructor (#547)
* fix(android): verify vl-model mode in AndroidAgent constructor

* fix: error message

---------

Co-authored-by: yutao <yutao.tao@bytedance.com>
2025-04-09 10:48:53 +08:00
github-actions[bot]
4d77207c0f release: v0.14.2 2025-04-09 02:30:47 +00:00
yuyutaotao
732f605144
fix(llm): coords offset in vl locator (#545) 2025-04-08 17:56:15 +08:00
Zhou Xiao
8582f86793
fix(playwright): Prevent the test from ending due to consistent timeout of screenshots (#541) 2025-04-07 16:02:55 +08:00
lsustc
bd0bf46e9a
fix click event should move the mouse to the click area first (#535)
* fix click event should move the mouse to the click area first

* fix reusing the existing function
2025-04-06 21:53:21 +08:00
Zhou Xiao
2c3c540cb1
fix(playwright): make test.describe work correctly with ai fixtures (#537)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-04-04 16:14:04 +08:00
github-actions[bot]
92e4d7ed61 release: v0.14.1 2025-04-03 08:40:18 +00:00
yuyutaotao
c6cd10ebb2
fix: filename (#534) 2025-04-03 16:37:00 +08:00
yuyutaotao
767ffcd873
docs: blog for instant action and deep think (#527) 2025-04-03 14:26:30 +08:00
github-actions[bot]
00dc2f9f50 release: v0.14.0 2025-04-03 06:01:06 +00:00
yuyutaotao
0171b5e2de
fix: ci testing (#529)
* fix: ci

* fix: ci
2025-04-03 10:51:50 +08:00
Leyang
bc645c3d0c
refactor: android api (#521)
* refactor: android api

* refactor: update the Android agent and device connection logic

* refactor: enhance Android agent to accept options for device connection

* fix: type error

* feat: use first device when no deviceId pass

* fix: import from agent.ts

* fix: click after clearInput

* fix: click before clearInput
2025-04-03 10:28:40 +08:00
yuyutaotao
bcdf90b997
feat: add context for aiAction (#528) 2025-04-02 20:34:23 +08:00
yuyutaotao
0c0675ba4b
feat: update deepThink interface (#517)
* feat: update deepThink interface

* fix: CI

* chore: update error message

* feat: update search area config

* fix: deepthink ui

---------

Co-authored-by: Zhou Xiao <zhouxiao.shaw@bytedance.com>
2025-04-02 19:26:56 +08:00
Zhou Xiao
814f99ca48
refactor(visualizer): optimize extension logic (#525) 2025-04-02 19:12:52 +08:00
Zhou Xiao
5ef9202c79
refactor(report): optimize visualizer common logic (#519)
* chore: delete unless visualizer logic

* chore: delete unless code

* chore: delete unless code

* chore: delete unless code

* chore: delete unless common logic

* chore: merge main branch

* chore: update lock file
2025-04-01 17:25:40 +08:00
yuyutaotao
cf4acf30d9
feat: print stats in debug (#515)
Co-authored-by: Zhou Xiao <zhouxiao.shaw@bytedance.com>
2025-04-01 17:20:37 +08:00
Leyang
882a2db9ee
fix: force screenshot when the app doesn't allow it (#518)
* fix: force screenshot when the app doesn't allow it

* fix: support start point in scroll method

* refactor: rename to ensureYadb
2025-04-01 17:05:47 +08:00
Zhou Xiao
ca28ae0b29
fix(report): resolve report script error (#516) 2025-04-01 15:03:42 +08:00
yuyutaotao
adc3fd9e0d
feat: use aiTap(prompt, opt) style interface (#512) 2025-04-01 10:41:31 +08:00
Zhou Xiao
d1b232586b
chore: delete unless visualizer logic (#514) 2025-03-31 19:51:43 +08:00
Zhou Xiao
6468bb0206
refactor(logger): use log file output instead of command line output in Node (#509)
* chore: add element detail info

* chore: add element detail info

* chore: add element detail info

* chore: optimize logger logic

* chore: optimize logger logic
2025-03-31 19:22:39 +08:00
Zhou Xiao
16d6efa1f0
refactor(report): make the report page an app type to improve the development experience (#513) 2025-03-31 18:46:35 +08:00
Leyang
bf9b4e06e7
feat: migrate android code to @midscene/android (#505) 2025-03-26 18:59:50 +08:00
Zhou Xiao
375a5b0152
chore: support new ui-tars model json format (#506) 2025-03-26 18:57:24 +08:00
Leyang
1f3fc5b580
chore: not run android tests in ci && update tests for android settig case (#502)
* chore: not run android tests in ci && update tests for android setting case

* ci: add lint for json
2025-03-26 11:39:39 +08:00
github-actions[bot]
00de5d6478 release: v0.13.1 2025-03-25 14:53:44 +00:00
Leyang
934a1e2b5d
use adb instead of appium (#483)
* feat: enable search area for locate

* fix: update evaluation

* fix: locator

* feat: show searchArea in report

* chore: add yaml support for aiTap

* feat: use adb instead appium

* feat: Adds debugging information and reconstructs input text capabilities

* feat: refactoring Android related functions and adding android modules

* feat: update the image scaling algorithm, adjust the Android page class to support device scaling, and remove test files that are no longer needed

* feat: adjust the Android page class to support device scaling, and remove test files that are no longer needed

* feat: use appium-adb instead of bare command

* fix: update entry for @midscene/android

* feat: optimize the screenshot processing logic, add a backup mechanism when screenshots fail, and update test cases to accommodate new features

* fix: rethrow error

* feat: add Android debug configuration options and update documentation

* chore: fix code style in #483 (#492)

* fix: remove try for error handle by outside

---------

Co-authored-by: yutao <yutao.tao@bytedance.com>
Co-authored-by: linyibing <linyibing@bytedance.com>
Co-authored-by: yuyutaotao <167746126+yuyutaotao@users.noreply.github.com>
2025-03-25 22:45:05 +08:00
yuyutaotao
010ee92d62
fix: playwright integration (#497)
* fix: playwright integration

* chore: fix types

---------

Co-authored-by: zhouxiao.shaw <zhouxiao.shaw@bytedance.com>
2025-03-25 17:22:25 +08:00
yuyutaotao
e94c31f228
fix: ai test (#494)
* fix: ai test

* fix: planning ci
2025-03-25 10:04:03 +08:00
github-actions[bot]
291f8e1a4b release: v0.13.0 2025-03-24 12:32:05 +00:00
yuyutaotao
999e52ebc1
feat: update prompt for qwen (#491) 2025-03-24 19:47:19 +08:00
Zhou Xiao
36d47e4aef
feat(playwright): add aiTap/aiInput and other AI actions for Playwright integration (#489)
* feat(playwright): add aiTap/aiInput and other AI actions for Playwright integration

* chore: update doc

---------

Co-authored-by: yutao <yutao.tao@bytedance.com>
2025-03-24 19:30:53 +08:00