UncleCode 9547bada3a feat(content): add target_elements parameter for selective content extraction
Adds new target_elements parameter to CrawlerRunConfig that allows more flexible content selection than css_selector. This enables focusing markdown generation and data extraction on specific elements while still processing the entire page for links and media.

Key changes:
- Added target_elements list parameter to CrawlerRunConfig
- Modified WebScrapingStrategy and LXMLWebScrapingStrategy to handle target_elements
- Updated documentation with examples and comparison between css_selector and target_elements
- Fixed table extraction in content_scraping_strategy.py

BREAKING CHANGE: Table extraction logic has been modified to better handle thead/tbody structures
2025-03-10 18:54:51 +08:00
..
2024-06-08 17:59:42 +08:00
2025-01-13 19:19:58 +08:00
2025-01-13 19:19:58 +08:00
2025-01-13 19:19:58 +08:00
2025-02-28 19:53:35 +08:00
2025-01-13 19:19:58 +08:00
2025-01-13 19:19:58 +08:00
2025-02-28 19:53:35 +08:00
2025-01-13 19:19:58 +08:00
2025-01-13 19:19:58 +08:00