84 Commits

Author SHA1 Message Date
Nicolas
ccbae4b155 Update auth.ts 2024-12-14 00:20:14 -03:00
Nicolas
8a1c404918 Nick: revert trailing comma 2024-12-11 19:51:08 -03:00
Nicolas
00335e2ba9 Nick: fixed prettier 2024-12-11 19:46:11 -03:00
Gergő Móricz
6776aee1c3 feat(auth): extend rate limiter logging to make it easier to debug 2024-12-09 19:29:32 +01:00
Nicolas
4d287bb77f Nick: moving acuc temp to read replica 2024-12-06 13:06:26 -03:00
Gergő Móricz
7bde034020 auth: log team id 2024-12-04 23:12:55 +01:00
Nicolas
f7207f91b4 Nick: temp e-s-1 2024-12-04 16:25:43 -03:00
Gergő Móricz
8d467c8ca7
WebScraper refactor into scrapeURL (#714)
* feat: use strictNullChecking

* feat: switch logger to Winston

* feat(scrapeURL): first batch

* fix(scrapeURL): error swallow

* fix(scrapeURL): add timeout to EngineResultsTracker

* fix(scrapeURL): report unexpected error to sentry

* chore: remove unused modules

* feat(transfomers/coerce): warn when a format's response is missing

* feat(scrapeURL): feature flag priorities, engine quality sorting, PDF and DOCX support

* (add note)

* feat(scrapeURL): wip readme

* feat(scrapeURL): LLM extract

* feat(scrapeURL): better warnings

* fix(scrapeURL/engines/fire-engine;playwright): fix screenshot

* feat(scrapeURL): add forceEngine internal option

* feat(scrapeURL/engines): scrapingbee

* feat(scrapeURL/transformars): uploadScreenshot

* feat(scrapeURL): more intense tests

* bunch of stuff

* get rid of WebScraper (mostly)

* adapt batch scrape

* add staging deploy workflow

* fix yaml

* fix logger issues

* fix v1 test schema

* feat(scrapeURL/fire-engine/chrome-cdp): remove wait inserts on actions

* scrapeURL: v0 backwards compat

* logger fixes

* feat(scrapeurl): v0 returnOnlyUrls support

* fix(scrapeURL/v0): URL leniency

* fix(batch-scrape): ts non-nullable

* fix(scrapeURL/fire-engine/chromecdp): fix wait action

* fix(logger): remove error debug key

* feat(requests.http): use dotenv expression

* fix(scrapeURL/extractMetadata): extract custom metadata

* fix crawl option conversion

* feat(scrapeURL): Add retry logic to robustFetch

* fix(scrapeURL): crawl stuff

* fix(scrapeURL): LLM extract

* fix(scrapeURL/v0): search fix

* fix(tests/v0): grant larger response size to v0 crawl status

* feat(scrapeURL): basic fetch engine

* feat(scrapeURL): playwright engine

* feat(scrapeURL): add url-specific parameters

* Update readme and examples

* added e2e tests for most parameters. Still a few actions, location and iframes to be done.

* fixed type

* Nick:

* Update scrape.ts

* Update index.ts

* added actions and base64 check

* Nick: skipTls feature flag?

* 403

* todo

* todo

* fixes

* yeet headers from url specific params

* add warning when final engine has feature deficit

* expose engine results tracker for ScrapeEvents implementation

* ingest scrape events

* fixed some tests

* comment

* Update index.test.ts

* fixed rawHtml

* Update index.test.ts

* update comments

* move geolocation to global f-e option, fix removeBase64Images

* Nick:

* trim url-specific params

* Update index.ts

---------

Co-authored-by: Eric Ciarla <ericciarla@yahoo.com>
Co-authored-by: rafaelmmiller <8574157+rafaelmmiller@users.noreply.github.com>
Co-authored-by: Nicolas <nicolascamara29@gmail.com>
2024-11-07 20:57:33 +01:00
Nicolas
9e22c9a428 Nick: etier1a 2024-11-04 18:14:38 -05:00
Nicolas
96c579f1cd Nick: etier2c 2024-10-29 21:01:43 -03:00
Nicolas
e0ba339c50 Update auth.ts 2024-10-29 16:06:12 -03:00
Nicolas
6948ca6fe1 Revert "Update auth.ts"
This reverts commit 7152ac8856c35e0e1dde39d112c27ffd679f3ec4.
2024-10-29 16:02:21 -03:00
Nicolas
7152ac8856 Update auth.ts 2024-10-29 15:58:20 -03:00
Nicolas
d965f2ce7d Nick: fixes 2024-10-24 23:13:30 -03:00
Nicolas
29b34270c8 Merge branch 'main' into nsc/pay-as-you-go-lw2 2024-10-24 22:31:04 -03:00
Nicolas
bbfdda8867 Nick: init 2024-10-22 19:47:23 -03:00
Gergő Móricz
6ed3104eb6 feat: clear ACUC cache endpoint based on team ID 2024-10-22 20:28:10 +02:00
Nicolas
c3a9630e33 Reapply "Merge pull request #773 from mendableai/nsc/retries-acuc-price-credits-fallback"
This reverts commit a6888ce17b98e4a47fbfc9796911446a6513959c.
2024-10-14 12:24:34 -03:00
rafaelsideguide
a6888ce17b Revert "Merge pull request #773 from mendableai/nsc/retries-acuc-price-credits-fallback"
This reverts commit ba9ad1ef7f02722ce88c1c29fa285befd3c3ec51, reversing
changes made to 666082a7ddfa52242b4dd27f2c5f37f1f98a3302.
2024-10-14 10:32:09 -03:00
Nicolas
78b6127d88 Nick: retries for acuc 2024-10-13 22:27:38 -03:00
Nicolas
0bff5b1a24 Update auth.ts 2024-10-11 15:29:25 -03:00
Nicolas
257a951132 Update auth.ts 2024-10-11 14:21:04 -03:00
Gergő Móricz
a8df85fd9b fix(acuc): remove sentry capture 2024-09-30 19:10:24 +02:00
Gergő Móricz
5e8ef4954e feat(auth): log cache key in acuc update error 2024-09-27 17:13:10 +02:00
Gergő Móricz
2cb493321a fix(ACUC): do not refresh cache every set 2024-09-26 22:15:15 +02:00
Gergő Móricz
9bdd344b36 fix(redlock): use redlock.using for stability 2024-09-26 22:15:15 +02:00
Gergő Móricz
250c3bb5c6 fix(auth): move redlock settings 2024-09-26 22:15:15 +02:00
Gergő Móricz
81245e68fa fix(auth/redlock): retry cached ACUC lock for 20 seconds 2024-09-26 22:15:15 +02:00
Gergő Móricz
0f89f5e7cb fix(billTeam): cache update race condition 2024-09-26 22:15:15 +02:00
Gergő Móricz
84bff8add8 fix(billTeam): update cached ACUC after billing 2024-09-26 22:15:15 +02:00
Gergő Móricz
2073063fb7 fix(db): fix caching and rpc error 2024-09-26 22:15:15 +02:00
Gergő Móricz
f8c70fe5dd feat(db): implement auth_credit_usage_chunk RPC 2024-09-26 22:15:15 +02:00
Nicolas
52ac132328
Update auth.ts 2024-08-30 11:10:48 -03:00
Gergő Móricz
d4001e4528 fix(api/websocket): fix auth and termination 2024-08-29 20:01:00 +02:00
rafaelsideguide
b7f62f93c4 Merge branch 'v1-webscraper' of https://github.com/mendableai/firecrawl into v1-webscraper 2024-08-28 14:18:07 -03:00
rafaelsideguide
1c35534e2a fix(v1): check-status for preview 2024-08-28 14:18:05 -03:00
Nicolas
af67b14c9d Nick: 2024-08-28 14:17:59 -03:00
Nicolas
08387c06f8 Update auth.ts 2024-08-28 14:09:12 -03:00
Nicolas
d872bf0c4c Merge branch 'main' into v1-webscraper 2024-08-28 12:42:23 -03:00
Nicolas
7dff5cdf49 Nick: fixed sdk types and map preview 2024-08-27 20:02:39 -03:00
Nicolas
1e08e6d317 Merge branch 'main' into nsc/job-priority 2024-08-27 15:01:58 -03:00
Nicolas
6ab6ef9004 Update auth.ts 2024-08-26 19:57:27 -03:00
Nicolas
7d93eab0f8 Nick: 2024-08-26 18:48:00 -03:00
Nicolas
5606fe5870 Nick: 2024-08-26 16:05:11 -03:00
Gergő Móricz
670d253a8c fix(auth): fix error reporting 2024-08-22 14:08:09 +02:00
Nicolas
8a778278a9 Merge branch 'main' into nsc/job-priority 2024-08-21 22:57:55 -03:00
Gergo Moricz
0cdf41587e feat(sentry): add error handles to try-catch blocks 2024-08-22 03:55:40 +02:00
Nicolas
6bdb1d045d Merge branch 'main' into nsc/job-priority 2024-08-21 21:52:05 -03:00
Nicolas
e78d2af1f0 Nick: 2024-08-21 21:51:54 -03:00
Nicolas
0ea0a5db46 Nick: wip 2024-08-21 20:54:39 -03:00