86 Commits

Author SHA1 Message Date
github-actions[bot]
68ef17f384 release: v0.21.0 2025-07-04 08:44:49 +00:00
github-actions[bot]
a4c541c71b release: v0.20.1 2025-06-25 04:55:17 +00:00
github-actions[bot]
277656aa6a release: v0.20.0 2025-06-19 03:45:05 +00:00
github-actions[bot]
bcfc2fcc29 release: v0.19.1 2025-06-17 05:23:12 +00:00
github-actions[bot]
95e24f98ee release: v0.19.0 2025-06-13 03:32:14 +00:00
Leyang
32f503f69e
fix(shared): reduce null element node (#827)
* fix(shared): reduce null element node

* feat(web-integration): delete content and locator, support visible-only for domIncluded

* feat(web-integration): remove logScreenshot documentation and update tests to use treeToList for content extraction

* feat(web-integration): add trimContextByViewport function to filter execution tasks by visible elements

* release: v0.18.1

* fix(workflow): update default release version to 'prepatch'

* fix(web-integration): node detect empty

* release: v0.18.2

* fix(evaluation): restore generateExtractData

* fix(web-integration): reporter export

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-06-13 11:25:31 +08:00
Leyang
3510a8c736
feat(web-integration): support unstableLogContent api for agent (#826)
* feat(web-integration): support unstableLogContent api for agent

* fix(web-integration): use sync mehtod

* docs(core): update API doc

* fix(web-integration): playwright

* docs(site): update

---------

Co-authored-by: yutao <yutao.tao@bytedance.com>
2025-06-11 16:50:11 +08:00
c9f9a48c70
feat(recorder): add record ability to chrome extension (#769)
* chore: add record example

* chore: delete unless file

* chore: optimize common logic

* chore: optimize record logic

* chore: support horizontal scroll

* chore: optimize label input logic

* chore: delete unless file

* chore: modify record log content

* chore: add record lib

* chore: optimize record iife logic

* chore: delete unless config

* chore: modify comment

* feat(chrome-extension): add recording functionality

- Add new record UI components (record.tsx, record.less)
- Integrate recording capabilities into popup extension
- Update worker script to handle recording events
- Add content script wrapper for recording injection
- Remove EventRecorder from record-form app (moved to extension)
- Update manifest.json permissions for recording
- Update dependencies for recording functionality
- Enhance store with recording state management

* feat(chrome-extension): improve recording stability with page refresh handling

- Add tab update monitoring to automatically stop recording on page refresh/navigation
- Implement content script injection check with ping mechanism
- Enhance error handling with specific messages for Chrome internal pages
- Improve recording stop reliability when content script becomes unavailable
- Add graceful handling for missing content script during stop operation

* fix(core): align dependency versions across workspace

- Update React to ^19.1.0 and React DOM to >=19.1.0 in all packages
- Update @types/react to ^19.1.4 and @types/react-dom to ^19.1.3
- Update TypeScript to ^5.8.3 across all packages
- Update dayjs to use caret version ^1.11.11
- Align all rsbuild and rslib plugin versions
- Fix dependency version inconsistencies to pass CI checks

* refactor(chrome-extension): rename content-script-wrapper to event-recorder-bridge

- Renamed content-script-wrapper.js to event-recorder-bridge.js
- Updated references in rsbuild.config.ts and record.tsx
- Updated internal comments to match new name
- New name better describes the file's bridge functionality

* feat(chrome-extension): enhance recording session management and UI

- Introduced RecordingSession interface for better session handling
- Implemented session persistence with localStorage for recording sessions
- Added UI components for creating, editing, and deleting recording sessions
- Enhanced event handling to load and display events from selected sessions
- Improved user feedback with success messages for session actions
- Updated styles for session management and recording status indicators

* feat(chrome-extension): enhance recording UI and session management

- Simplified the Record component structure by separating list and detail views for better user experience.
- Improved styling for the recording sessions list and detail views, ensuring better responsiveness and usability.
- Added functionality to switch between list and detail views for recording sessions.
- Updated session management to allow for viewing, editing, and deleting sessions with improved user feedback.
- Enhanced event handling and display for recorded events within the detail view.

* feat(record): add screenshot capture for recorded events

- Add screenshot capture before and after each recorded event
- Implement screenshot capture in chrome extension service worker
- Add screenshot fields to RecordedEvent interface
- Create UI components for displaying screenshot thumbnails in timeline
- Add hover effects and popover for screenshot preview
- Remove unused AGENTS.md file

* feat(chrome-extension): align React and TypeScript versions across the project

- Downgrade React and React DOM to 18.3.1 in all packages for consistency
- Update @types/react and @types/react-dom to 18.3.1
- Ensure TypeScript version is consistently set to ^5.8.3
- Adjust dependencies in package.json files for various apps and packages
- Enhance Chrome extension functionality with safe API wrappers for better error handling
- Implement checks for Chrome extension environment in recording components

* feat(chrome-extension): enhance event recording with new features and dependencies

- Add @rsbuild/plugin-node-polyfill to support Node.js polyfills in the project
- Update event recording logic to include viewport dimensions and element positioning
- Improve screenshot handling by integrating new properties in RecordedEvent interface
- Enhance RecordTimeline component to display element positioning and dimensions
- Optimize event handling and UI updates for better user experience
- Clean up and refactor code for improved readability and maintainability

* feat(record): enhance event data with coordinates in event recorder

- Add x and y coordinates to the event data sent to the Chrome extension
- Update RecordTimeline component to utilize new coordinate properties for element positioning
- Ensure compatibility with existing event handling logic

* chore: optimize select box logic

* chore: delete unless config

* feat(record): optimize AI description generation to be asynchronous and non-blocking

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>

* feat(record): implement smooth scroll with enhanced animations and performance

- Add requestAnimationFrame-based smooth scrolling for better performance
- Implement container-level auto-scroll to bottom when new events arrive
- Extract event optimization logic to separate eventOptimizer utility module
- Add smooth scroll CSS properties and touch optimization for mobile
- Enhance hover animations with cubic-bezier easing and GPU acceleration
- Add slideInUp and fadeInScale keyframe animations for timeline items
- Improve scrollbar styling with transition effects
- Optimize will-change properties for better rendering performance
- Add overscroll-behavior to prevent scroll propagation

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>

* chore(workflow): add CLAUDE.md with project documentation and development guide

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>

* feat(record): optimize session creation to auto-start recording with timestamp name

* feat(record): optimize session creation to use timestamp naming by default

- Modified session creation to auto-create with timestamp name without showing modal
- Improved record UI to display session titles more effectively
- Reorganized description display in both list and detail views

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>

* feat(record): improve session management and code quality

     - Optimize session creation to use timestamp naming by default without requiring modal input
     - Improve UI layout to display titles and descriptions more effectively in both list and
     detail views
     - Apply code formatting improvements across the codebase for better readability
     - Fix linting issues in scripts and component files

* feat(record): implement caching for element descriptions and screenshots to enhance performance

* feat(record): refactor stopRecording logic and add visibility change handling for extension popup

* feat(record): auto-stop recording when leaving detail view

* feat(record): improve recording state management and handle asynchronous stop actions

* chore: upgrade lock file

* feat(record): add persistence for recording state in local storage

* feat(record): add modular structure with components and hooks for improved organization

* feat(record): implement AI-generated titles and descriptions for recording sessions

* feat(record): enhance recording session management with improved layout and AI title generation

* feat(record): enhance recording control hook with session update callback

* fix: update coding standards and improve code formatting across multiple files

* feat(record): add Playwright test generation feature and export controls

* feat(record): enhance AI description generation with caching and callback handling

* feat(record): replace RecordedEvent with ChromeRecordedEvent for improved event handling

* feat(record): improve debug logging and event handling in recording sessions

* feat(record): enhance event optimization and logging, add diagnosis feature for recording chain

* feat(record): update RecordDetail and RecordList components, improve diagnostic function and type imports

* feat(record): enhance tab monitoring with event listeners for activation and updates

* feat(record): sort sessions by updated time in RecordList component

* feat(record): implement session ID handling and cleanup for previous recordings

* feat(eventOptimizer): improve element rectangle handling and callback updates in description generation

* feat(record): enhance AI title and description generation logic, improve event handling and logging

* feat(record): implement debounce mechanism for AI description generation and enhance event rectangle validation

* feat(record): enhance AI description generation with improved caching and debouncing logic, and add session ID dependency in cleanup

* feat(record): update AI title and description generation logic to prevent overwriting existing session names and descriptions, and improve event handling in scroll events

* refactor(record): clean up code formatting and improve logging consistency across components

* docs(record): update CLAUDE.md with detailed implementation and optimization strategies for the recording system

* feat(record): enhance event details display with JSON copy functionality and improved cache key generation

* chore(commitlint): increase maximum header length to 300 characters

* chore(commitlint): disable scope-enum rule enforcement

* feat(record): implement unified logging system and replace console logs with structured logger

* feat(record): add hash ID generation for recorded events and clean up elementRect handling

* feat(record): add before and after screenshot functionality to event timeline

* feat(record): remove event-recorder-bridge script and update build configuration

* feat(record): update screenshot handling in RecordTimeline component to display before and after images

* refactor(record): clean up code formatting and remove unnecessary lines

* feat(build): enhance build configuration with caching and input/output settings

* feat(record): pause active recording sessions when creating a new session

* feat(record): enhance Playwright export controls with live event handling and improved description checks

* refactor(record): streamline configuration and improve logging clarity

* feat(record): update YAML generation to follow Midscene.js conventions with enhanced action mapping and descriptive naming

* feat(record): simplify YAML generation for Midscene.js with clearer structure and updated action mapping

* feat(record): enhance YAML export functionality with improved event handling and loading state

* feat(record): add Playwright and YAML test generation utilities with shared types and export controls

* feat(record): replace PlaywrightExportControls with a unified ExportControls component

* feat(record): integrate Playwright test generation from core package and update language handling in UI components

* feat(record): add @midscene/record dependency and integrate YAML test generation from core package

* feat(record): enhance event handling by saving current recording events during session updates and cleanup

* feat(record): enhance event recording by adding final navigation event and improving error handling in event recorder bridge

* feat(record): improve event handling by conditionally clearing events during recording start

* feat(record): optimize event handling by implementing debouncing and refining fallback descriptions

* feat(record): enhance test generation by extracting navigation and viewport info for improved options

* feat(record): implement event persistence with local storage for recording sessions

* chore: clean up code formatting and improve readability across multiple files

* feat(record): add dropdown menu for code generation and streamline state management

* feat(record): remove YAML export functionality from ExportControls and related components

* feat(record): add ProgressModal component for enhanced recording feedback

* feat(record): enhance ExportControls and ProgressModal with improved UI and confetti animation

* feat(record): simplify ExportControls and RecordDetail components by removing unused functionality and improving session details display

* feat(record): improve error handling and logging in content script checks and enhance tab URL validation

* feat(record): add generatedCode field to RecordingSession and improve error handling in content script checks

* feat(record): add canvas-confetti for celebratory effects in ProgressModal and update dependencies

* feat(record): refactor RecordDetail to use sessionId and improve session retrieval with error handling

* feat(progress-modal): adjust confetti display duration and update completion timing in ProgressModal

* chore: upgrade lock file

* refactor(snapshot): simplify element description rules for clarity and brevity

* feat(record): initialize optimizedEvents state as empty array and refactor logging methods for improved clarity

* refactor(export-controls): streamline event description generation and update progress step order for clarity

* feat(progress-modal): add ShinyText component for enhanced step title display during loading

* refactor(export-controls): consolidate test generation logic into a common function for improved maintainability

* refactor(export-controls, eventOptimizer): improve code readability and consistency by formatting and simplifying logic

* chore: delete unless config

* chore: add ignore file

* chore: add ignore file

* refactor: migrate from 'record' to 'recorder' package and consolidate related files

* refactor: remove playwright example file and integrate example code into prompts and YAML generation

* chore: hidden recorder

* chore: delete unless file

* chore: delete unless file

* refactor: migrate from 'record-form' to 'recorder-form' and update related configurations

* refactor: remove shiny-text component and update imports in related files

* refactor: enhance ProgressModal styling for better alignment and readability

* refactor: update finalEvents assignment in ExportControls for session title generation

* refactor: comment out recorder section in PlaygroundPopup for future review

* chore(ci): fix lint error

---------

Co-authored-by: Claude <noreply@anthropic.com>
2025-06-11 16:40:21 +08:00
github-actions[bot]
67590d3273 release: v0.18.0 2025-06-09 03:04:12 +00:00
github-actions[bot]
e8db6dc56a release: v0.17.5 2025-06-03 08:21:59 +00:00
github-actions[bot]
14b9200236 release: v0.17.4 2025-05-30 06:31:25 +00:00
Leyang
ca8edcd328
ai query support query from html dom (#782)
* feat(web-integration): query from all dom element

* test(web-integration): add tests

* fix(cli): tests

* fix(shared): tests

* fix(core): tests

* feat(evaluation): update data

* refactor(web-integration): update var

* Revert "feat(evaluation): update data"

This reverts commit d882bddd38f3ce3a24d14a345c71aaf3bd725aab.

* test(web-integration): fix tests
2025-05-30 11:23:00 +08:00
yuyutaotao
3c04995e26
fix(core): update error log of dump file (#757)
* fix(core): update error log of dump file

* chore(workflow): update ci config

* fix(core): update evaluation

* chore(core): add test case

* chore(core): update ci config

* chore(core): update ci config

* fix(core): ci

* fix(core): update inspect test case

* fix(core): e2e

* fix(puppeteer): show model info in report

* fix(core): update evaluation config

* chore(core): fix lint
2025-05-26 23:50:34 +08:00
github-actions[bot]
0dd38c2537 release: v0.17.3 2025-05-26 06:36:57 +00:00
github-actions[bot]
3b4671b0be release: v0.17.2 2025-05-23 06:18:49 +00:00
yuyutaotao
1589d10f35
docs(core): update blog (#735)
* docs(core): update blog

* docs(blog): update workflow blog

* docs(site): add details and pic

* docs(site): update pic

* docs(site): update pic 2

---------

Co-authored-by: quanruzhuoxiu <quanruzhuoxiu@gmail.com>
2025-05-21 16:49:26 +08:00
github-actions[bot]
049d9f6565 release: v0.17.1 2025-05-20 09:11:17 +00:00
bac9051d2d
feat(evaluation): add screenspot v2 evaluation (#737)
* feat(evaluation): add screenspot v2 evaluation

* style(evaluation): format files array in package.json
2025-05-20 15:52:03 +08:00
github-actions[bot]
39ddc57df3 release: v0.17.0 2025-05-16 14:32:01 +00:00
yuyutaotao
b261ed7f2a
feat(web): use xpath and yaml as cache (#711)
* feat(web-integration): use xpath for cache instead of id

* feat(web-integration): enhance TaskCache to support xpaths for cache matching and add new test cases

* feat(web-integration): add debug log for unknown page types in TaskCache

* feat(web-integration): update caching logic and cache hit conditions for Plan and Locate tasks

* chore(core): update debug log

* feat(web-integration): update rspress.config and enhance TaskCache structure with new properties

* feat(web-integration): recalculate id when hit cache

* fix(web-integration): update mock implementation in task-cache test to use evaluate method

* feat(web-integration): enhance element caching by adding XPath support and improving cache hit logic

* chore(core): lint

* feat(web-integration): improve XPath handling in web-extractor

* test(web-integration): fix tests

* feat(core, web-integration): add attributes to LocateResultElement and enhance element handling

* fix(core): lint

* feat(web-integration): add midsceneVersion to TaskCache and update cache validation logic

* fix(core): test

* fix(web-integration): update cache validation logic to prevent reading outdated midscene cache files

* feat(web-integration): enhance TaskCache to track used cache items and improve cache retrieval logic

* fix(core): xpath logic (#710)

* feat(core): resue context for locate

* feat(core): build yamlFlow from aiAction

* feat(core): refine task-cache

* feat(core): update cache

* feat(core): refine task-cache

* feat(core): refine task-cache

* feat(core): remove unused checkElementExistsByXPath

* feat(core): use yaml file as cache

* chore(core): fix lint

* chore(core): print warning for previous cache

* refactor(core): remove quickAnswer references and improve element matching logic

* fix(core): update import path for buildYamlFlowFromPlans

* chore(web-integration): update output image and skip task error test

* fix(web-integration): update test snapshots to handle beta versions

* fix(web-integration): adjust test snapshots for version consistency

* fix(web-integration): track original cache length and adjust matching logic in tests

* fix(web-integration): update test URLs to reflect new target site and enable previously skipped test

* chore(core): update cache docs

* fix(core): test

* feat(core): try to match element from plan

* fix(web-integration): cache id stable when retry in palywright

* fix(web-integration): typo

* style(web-integration): lint

* fix(web-integration): stable cacheid in tests

* fix(web-integration): cache id

---------

Co-authored-by: quanruzhuoxiu <quanruzhuoxiu@gmail.com>
2025-05-16 17:16:56 +08:00
github-actions[bot]
571ec07829 release: v0.16.10 2025-05-09 09:00:22 +00:00
github-actions[bot]
7f982f7cb7 release: v0.16.9 2025-05-09 03:44:51 +00:00
github-actions[bot]
c5a88931f0 release: v0.16.8 2025-04-30 14:31:14 +00:00
github-actions[bot]
7ce8225369 release: v0.16.7 2025-04-29 08:36:53 +00:00
github-actions[bot]
aa2e767b79 release: v0.16.6 2025-04-29 05:54:48 +00:00
github-actions[bot]
9c5d22a57e release: v0.16.5 2025-04-29 05:17:39 +00:00
github-actions[bot]
061ff92c69 release: v0.16.4 2025-04-29 04:25:49 +00:00
github-actions[bot]
4f41e99ddc release: v0.16.3 2025-04-28 15:14:58 +00:00
github-actions[bot]
c4112adb51 release: v0.16.2 2025-04-28 06:22:02 +00:00
github-actions[bot]
33d2733c77 release: v0.16.1 2025-04-28 02:58:31 +00:00
github-actions[bot]
8acbb6a32f release: v0.16.0 2025-04-28 00:26:25 +00:00
github-actions[bot]
ee3821f7f9 release: v0.15.5 2025-04-26 07:42:06 +00:00
github-actions[bot]
767b3e3a8c release: v0.15.4 2025-04-25 09:18:44 +00:00
Leyang
ca644d8914
feat(core): allow custom midscene_run dir (#631)
* feat(core): support custom midscene_run dir

* feat(report): add search functionality to PlaywrightCaseSelector component

* refactor(shared): simplify base directory resolution and remove unused environment variable

* feat(shared): integrate shared environment variables across multiple packages

* refactor(shared): update base directory resolution to use dynamic midscene_run directory

* fix(puppeteer): increase screenshot timeout from 3s to 10s for improved reliability
2025-04-24 22:54:52 +08:00
github-actions[bot]
bd104a8afa release: v0.15.3 2025-04-24 08:04:27 +00:00
github-actions[bot]
ec3fd040d4 release: v0.15.2 2025-04-23 01:51:12 +00:00
github-actions[bot]
385f3eb639 release: v0.15.1 2025-04-21 12:13:32 +00:00
github-actions[bot]
0a3bc1e230 release: v0.15.0 2025-04-18 11:26:30 +00:00
yuyutaotao
dd233ef25b
chore: adapting new model (#570)
* chore: adapting new model

* feat: adapt bbox format

* chore: adapt ui-tars
---------

Co-authored-by: zhouxiao.shaw <zhouxiao.shaw@bytedance.com>
2025-04-17 10:53:19 +08:00
yuyutaotao
edf48da198
docs: update caching docs (#559)
* docs: update caching docs

* docs: update caching docs

* docs: update docs

* chore: update docs
2025-04-14 14:50:48 +08:00
github-actions[bot]
1bd873b360 release: v0.14.3 2025-04-10 09:10:39 +00:00
yuyutaotao
66a5d16d37
chore: update evaluation fixture (#552) 2025-04-10 16:54:21 +08:00
github-actions[bot]
4d77207c0f release: v0.14.2 2025-04-09 02:30:47 +00:00
yuyutaotao
732f605144
fix(llm): coords offset in vl locator (#545) 2025-04-08 17:56:15 +08:00
github-actions[bot]
92e4d7ed61 release: v0.14.1 2025-04-03 08:40:18 +00:00
github-actions[bot]
00dc2f9f50 release: v0.14.0 2025-04-03 06:01:06 +00:00
yuyutaotao
0171b5e2de
fix: ci testing (#529)
* fix: ci

* fix: ci
2025-04-03 10:51:50 +08:00
yuyutaotao
bcdf90b997
feat: add context for aiAction (#528) 2025-04-02 20:34:23 +08:00
yuyutaotao
0c0675ba4b
feat: update deepThink interface (#517)
* feat: update deepThink interface

* fix: CI

* chore: update error message

* feat: update search area config

* fix: deepthink ui

---------

Co-authored-by: Zhou Xiao <zhouxiao.shaw@bytedance.com>
2025-04-02 19:26:56 +08:00
Leyang
bf9b4e06e7
feat: migrate android code to @midscene/android (#505) 2025-03-26 18:59:50 +08:00