autogen

mirror of https://github.com/microsoft/autogen.git synced 2025-10-28 08:19:09 +00:00

Author	SHA1	Message	Date
Chi Wang	cafb67123a	Merge branch 'main' into LexiFlow	2022-10-14 11:04:18 -07:00
Susan Xueqing Liu	2ebddd67ae	Remove NLP classification head (#756 ) * rm classification head in nlp * rm classification head in nlp * rm classification head in nlp * adding test cases for switch classification head * adding test cases for switch classification head * Update test/nlp/test_autohf_classificationhead.py Co-authored-by: Chi Wang <wang.chi@microsoft.com> * adding test cases for switch classification head * run each test separately * skip classification head test on windows * disabling wandb reporting * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * fix test nlp custom metric Co-authored-by: Chi Wang <wang.chi@microsoft.com>	2022-10-12 17:04:42 -07:00
Anonymous-submission-repo	9bc32acafb	first	2022-10-09 11:39:29 -04:00
Xueqing Liu	c01e65bb48	updating the data collator for seq-regression to handle the dim mismatch problem (#751 )	2022-10-08 21:59:31 -07:00
Xueqing Liu	ceb3e300cd	Issue724 (#745 ) * fixing issue724 * fixing issue724	2022-10-04 10:51:12 -04:00
Xueqing Liu	21fa6c10ec	Fixing the issue that FLAML trial number is significantly smaller than Transformers.hyperparameter_search (#657 ) * fix 636 * adding low cost config * update padding; update tokenization output y type (series -> DF); update low cost init config * updating todf; updating metric_loss_score	2022-08-03 00:11:29 -04:00
Xueqing Liu	5eb5d43d7f	Fix HPO evaluation bug (#645 ) * fix eval automl metric bug on val_loss inconsistency * updating starting point search space to continuous * shortening notebok	2022-07-28 23:08:42 -04:00
Xueqing Liu	214566313c	disable max_len for ner (#629 ) * disable max_len for ner	2022-07-10 06:33:02 -04:00
Xueqing Liu	6cb6a2a19a	isinstance(x, int) -> isinstance(x, (int, np.integer)) (#627 ) * isinstance(x, int) -> isinstance(x, (int, np.integer))	2022-07-06 13:22:05 -04:00
Xueqing Liu	6108493e0b	fix ner bug; refactor post processing of TransformersEstimator prediction (#615 ) * fix ner bug; refactor post processing * fix too many values to unpack * supporting id/token label for NER	2022-07-05 13:38:21 -04:00
Xueqing Liu	79a24d06a9	fixing a bug in nlp/utils.py (#590 ) * fixing a bug for ner	2022-06-14 17:31:12 -04:00
Xueqing Liu	2a8decdc50	fix the post-processing bug in NER (#534 ) * fix conll bug * update DataCollatorForAuto * adding label_list comments	2022-05-10 17:22:57 -04:00
Xueqing Liu	ca35fa969f	refactoring TransformersEstimator to support default and custom_hp (#511 ) * refactoring TransformersEstimator to support default and custom_hp * handling starting_points not in search space * addressing starting point more than max_iter * fixing upper < lower bug	2022-04-28 14:06:29 -04:00
liususan091219	0bcf618fea	fixing bug	2022-03-28 11:53:58 -07:00
Xueqing Liu	72301b8568	fixing a few bugs in nlp (#503 ) * fixing bugs in nlp	2022-03-26 14:08:51 -04:00
Xueqing Liu	af423463c3	fixing bug for ner (#463 ) * fixing bug for ner * removing global var * adding class for trial counter * adding notebook * adding use_ray dict * updating documentation for nlp	2022-03-20 22:03:02 -04:00
Kevin Chen	81f54026c9	Support time series forecasting for discrete target variable (#416 ) * support 'ts_forecast_classification' task to forecast discrete values * update test_forecast.py - add test for forecasting discrete values * update test_model.py * pre-commit changes	2022-01-24 18:39:36 -08:00
Xueqing Liu	4814091d87	remove redundant imports (#426 ) * remove redundant imports * getting ride of hf dataset	2022-01-24 14:24:14 -08:00
Xueqing Liu	3ef758cd7b	reducing AutoConfig.from_pretrained (#411 ) * reducing AutoConfig.from_pretrained	2022-01-17 11:44:11 -08:00
Xueqing Liu	cb9c7b0d16	adding logging of training loss (#406 ) * reducing AutoTokenizer load to only once * fixing early stop bug	2022-01-16 09:07:31 -08:00
Xueqing Liu	31645187f3	Update flaml/nlp/README.md (#404 )	2022-01-14 17:55:38 -08:00
Xueqing Liu	dda4ac90a1	moving intermediate_results logging from model.py to huggingface/trainer.py (#403 ) * replacing val_loss with automl_metric	2022-01-14 17:26:10 -08:00
Xueqing Liu	c1b5cb5348	fixing default metric for regression + change verbosity for transformers (#397 ) * fixing default metric for regression + change verbosity for transformers * fixing per_device_train_batch_size * Update flaml/automl.py for gpu_per_trial	2022-01-13 21:08:51 -08:00
Xueqing Liu	bd66e40296	fixing load best model at the end (#389 )	2022-01-11 10:47:53 -08:00
Xueqing Liu	c54c1246c6	fixing auto metric bug (#387 )	2022-01-07 16:25:58 -08:00
Xueqing Liu	207b6935d9	adding token classification (#376 ) * adding ner	2022-01-03 13:44:10 -05:00
oberonbot	9c00e4272a	Finish the Multiple Choice Classification (#367 ) * adding multiple choice * update test cases (hard coded) * merged common code in predict_proba and predict in TransformersEstimator	2022-01-02 20:12:34 -05:00
Xueqing Liu	b2900f4b22	fixing custom metric (#357 ) * fixing the error for custom metric	2021-12-24 16:23:09 -05:00
Chi Wang	0b25e89f29	reproducibility for random sampling (#349 ) * reproducibility for random sampling #236 * doc update	2021-12-22 12:12:25 -08:00
Xueqing Liu	ee3162e232	Adding the NLP task summarization (#346 ) * Add test_autohf_summarization.py * adding seq2seq * Update flaml/nlp/huggingface/trainer.py * rouge metrics Co-authored-by: XinZofStevens <xzhao4346@gmail.com> Co-authored-by: JinzhuoWu <wujinzhuo0105@gmail.com> Co-authored-by: Chi Wang <wang.chi@microsoft.com>	2021-12-20 14:19:32 -08:00
Chi Wang	efd85b4c86	Deploy a new doc website (#338 ) A new documentation website. And: * add actions for doc * update docstr * installation instructions for doc dev * unify README and Getting Started * rename notebook * doc about best_model_for_estimator #340 * docstr for keep_search_state #340 * DNN Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu> Co-authored-by: Z.sk <shaokunzhang@psu.edu>	2021-12-16 17:11:33 -08:00
Xueqing Liu	1a3e01c352	adding HF metrics (#335 ) * adding nlp metrics * fix ndcg	2021-12-10 12:32:49 -05:00
Chi Wang	3111084c07	add __init__.py in nlp	2021-12-06 09:15:39 -08:00
Xueqing Liu	fb59bb9928	adding TODOs for NLP module, so students can implement other tasks easier (#321 ) * fixing ray pickle bug, skipping macosx bug, completing code for seqregression * catching connectionerror * ading TODOs for NLP module	2021-12-03 12:45:16 -05:00
Xueqing Liu	fd136b02d1	bug fix for TransformerEstimator (#293 ) * fix checkpoint naming + trial id for non-ray mode, fix the bug in running test mode, delete all the checkpoints in non-ray mode * finished testing for checkpoint naming, delete checkpoint, ray, max iter = 1 * adding predict_proba, address PR 293's comments close #293 #291	2021-11-23 11:26:39 -08:00
Chi Wang	72caa2172d	model_history, ITER_HP, settings in AutoML(), checkpoint bug fix (#283 ) if save_best_model_per_estimator is False and retrain_final is True, unfit the model after evaluation in HPO. retrain if using ray. update ITER_HP in config after a trial is finished. change prophet logging level. example and notebook update. allow settings to be passed to AutoML constructor. Are you planning to add multi-output-regression capability to FLAML #192 Is multi-tasking allowed? #277 can pass the auotml setting to the constructor instead of requiring a derived class. remove model_history. checkpoint bug fix. * model_history meaning save_best_model_per_estimator * ITER_HP * example update * prophet logging level * comment update in forecast notebook * print format improvement * allow settings to be passed to AutoML constructor * checkpoint bug fix * time limit for autohf regression test * skip slow test on macos * cleanup before del	2021-11-18 09:39:45 -08:00
Xueqing Liu	42de3075e9	Make NLP tasks available from AutoML.fit() (#210 ) Sequence classification and regression: "seq-classification" and "seq-regression" Co-authored-by: Chi Wang <wang.chi@microsoft.com>	2021-11-16 11:06:20 -08:00
Chi Wang	92ebd1f7f9	when max_iter=1, skip search only if retrain_final (#280 ) * when max_iter=1, skip search only if retrain_final * remove nlp redesign in #210 * minor change in readme example	2021-11-09 21:51:23 -08:00
Chi Wang	f48ca2618f	warning -> info for low cost partial config (#231 ) * warning -> info for low cost partial config #195, #110 * when n_estimators < 0, use trained_estimator's * log debug info * test random seed * remove "objective"; avoid ZeroDivisionError * hp config to estimator params * check type of searcher * default n_jobs * try import * Update searchalgo_auto.py * CLASSIFICATION * auto_augment flag * min_sample_size * make catboost optional	2021-10-08 16:09:43 -07:00
Chi Wang	339eb80f44	variable name (#187 )	2021-09-04 20:28:37 -07:00
Chi Wang	e46573a01d	warmstart blendsearch (#186 ) * increase test coverage * use define by run only when needed * warmstart bs * classification -> binary, multi * warm start with evaluated rewards * data transformer; resource attr for gs * BlendSearchTuner bug fix and unittest * bug fix * docstr and import * task type	2021-09-04 01:42:21 -07:00
Qingyun Wu	a229a6112a	Support parallel and add random search (#167 ) * non hashable value out of signature * parallel trials * add random in _search_parallel * fix bug in retraining * check memory constraint before training * retrain_full * log custom metric * retraining budget check * sample size check before retrain * remove 'time2eval' from result * report 'total_search_time' in result * rename total_search_time to wall_clock_time * rename train_loss boolean to log_training_metric * set default train_loss to None * exclude oom result * log retrained model * no subsample * doc str * notebook * predicted value is NaN for sarimax * version Co-authored-by: Chi Wang <wang.chi@microsoft.com> Co-authored-by: Qingyun Wu <qxw5138@psu.edu>	2021-08-23 16:36:51 -07:00
Xueqing Liu	eeaf5b5963	space -> main (#148 ) * subspace in flow2 * search space and trainable from AutoML * experimental features: multivariate TPE, grouping, add_evaluated_points * test experimental features * readme * define by run * set time_budget_s for bs Co-authored-by: liususan091219 <Xqq630517> * version * acl * test define_by_run_func * size * constraints Co-authored-by: Chi Wang <wang.chi@microsoft.com>	2021-08-02 16:10:26 -07:00
Xueqing Liu	d40993d920	apidoc (#116 ) * fixing apidoc errors Co-authored-by: Chi Wang (MSR) <wang.chi@microsoft.com> Co-authored-by: liususan091219 <Xqq630517>	2021-06-19 19:09:49 -07:00
Xueqing Liu	6133db84e8	remove learning_rate and weight_decay (#113 ) * remove varying_arg1, varying_args	2021-06-19 09:27:51 -07:00
Xueqing Liu	cd4be9c0e5	add notebook (#109 ) * added support for transformers==3.4.0 * updating error message * adding arxiv	2021-06-17 21:42:26 -07:00
Xueqing Liu	a5a5a4bc20	fixed API doc and import (#108 ) * removed run_analysis.py, run_autohf.py, test_jupyter.py	2021-06-15 09:55:23 -07:00
Xueqing Liu	926589bdda	exception, coverage for autohf (#106 ) * increase coverage * fixing exception messages * fixing import	2021-06-14 14:11:40 -07:00
Xueqing Liu	a4049ad9b6	autohf (#43 ) automate huggingface transformer	2021-06-09 08:37:03 -07:00

49 Commits