autogen

mirror of https://github.com/microsoft/autogen.git synced 2025-07-25 09:51:00 +00:00

Author	SHA1	Message	Date
Xueqing Liu	2a8decdc50	fix the post-processing bug in NER (#534 ) * fix conll bug * update DataCollatorForAuto * adding label_list comments	2022-05-10 17:22:57 -04:00
Xueqing Liu	ca35fa969f	refactoring TransformersEstimator to support default and custom_hp (#511 ) * refactoring TransformersEstimator to support default and custom_hp * handling starting_points not in search space * addressing starting point more than max_iter * fixing upper < lower bug	2022-04-28 14:06:29 -04:00
Jaden Kropp	d03038bfcb	docstr cleanup #523 : removed lines 259 to 260 in a1c49ca (#524 )	2022-04-27 07:50:38 -07:00
Xueqing Liu	cfed657812	Handling fractional gpu_per_trial for NLP (#513 ) * handling fractional gpu_per_trial	2022-04-12 14:46:14 -04:00
Xueqing Liu	72301b8568	fixing a few bugs in nlp (#503 ) * fixing bugs in nlp	2022-03-26 14:08:51 -04:00
Xueqing Liu	5f97532986	adding evaluation (#495 ) * adding automl.score * fixing the metric name in train_with_config * adding pickle after score * fixing a bug in automl.pickle	2022-03-25 17:00:08 -04:00
Xueqing Liu	af423463c3	fixing bug for ner (#463 ) * fixing bug for ner * removing global var * adding class for trial counter * adding notebook * adding use_ray dict * updating documentation for nlp	2022-03-20 22:03:02 -04:00
Kevin Chen	f9eda0cc40	update documentation for time series forecasting (#472 ) * update automl.py - documentation update * update test_forecast.py * update model.py * update automl_time_series_forecast.ipynb * update time series forecast website examples Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com>	2022-03-08 11:21:18 -08:00
Chi Wang	31ac984c4b	don't init global search with points_to_evaluate unless evaluated_rewards is provided; handle callbacks in fit kwargs (#469 )	2022-03-01 18:39:16 -08:00
Chi Wang	df01031cfe	Zero-shot AutoML (#468 ) * Prepare for release Co-authored-by: Moe Kayali <t-moekayali@microsoft.com> * bug fix * improve doc and code quality Co-authored-by: Qingyun Wu	2022-03-01 15:39:09 -08:00
Chi Wang	6960a833ec	Gpu support for xgboost (#442 ) * xgboost gpu support * test xgboost gpu * test sparse data * add xgboost test * remove ray.init to avoid pytest error	2022-01-30 13:02:18 -08:00
Kevin Chen	c75f97b475	Change the upper bound for "lags" hyperparameter for sklearn forecast models (#437 ) * update model.py - change upper bound for "lags" hyperparameter * update test_forecast.py - add a test for a large dataset * update sample.py - pre-commit changes	2022-01-30 07:30:30 -08:00
Kevin Chen	81f54026c9	Support time series forecasting for discrete target variable (#416 ) * support 'ts_forecast_classification' task to forecast discrete values * update test_forecast.py - add test for forecasting discrete values * update test_model.py * pre-commit changes	2022-01-24 18:39:36 -08:00
Chi Wang	38ad31ea25	remove FLAML sample size from config (#418 )	2022-01-22 22:59:44 -08:00
Xueqing Liu	47d2295fb7	Set use_ray to True for logging to databricks (#414 ) * fixing use_ray bug	2022-01-18 18:37:35 -08:00
MichaelMarien	1c911da9f8	Sklearn api x (#405 ) * changed signature of automl.predict and automl.predict_proba to X * XGBoostEstimator * changed signature of Prophet predict to X * changed signature of ARIMA predict to X * changed signature of TS_SKLearn_Regressor predict to X	2022-01-16 14:37:56 -08:00
Xueqing Liu	cb9c7b0d16	adding logging of training loss (#406 ) * reducing AutoTokenizer load to only once * fixing early stop bug	2022-01-16 09:07:31 -08:00
Xueqing Liu	dda4ac90a1	moving intermediate_results logging from model.py to huggingface/trainer.py (#403 ) * replacing val_loss with automl_metric	2022-01-14 17:26:10 -08:00
Chi Wang	569908fbe6	fix issues in logging, bug in space.py, constraint sign, and improve code coverage (#388 ) * console log handler * version update * doc * skippable steps * notebook update * constraint sign * doc for constraints * bug fix: define-by-run and unflatten_hierarchical * const * handle nested space in indexof() * test grid search * test suggestion * model test * >1 ckpts * always increase iter count * log total # iterations * security patch * make iter_per_learner consistent	2022-01-14 13:39:09 -08:00
Xueqing Liu	c1b5cb5348	fixing default metric for regression + change verbosity for transformers (#397 ) * fixing default metric for regression + change verbosity for transformers * fixing per_device_train_batch_size * Update flaml/automl.py for gpu_per_trial	2022-01-13 21:08:51 -08:00
Xueqing Liu	f41f1c2198	Logging multiple checkpoints (#394 )	2022-01-12 19:50:39 -08:00
liususan091219	303d40c76c	set verbose for transformers	2022-01-11 21:42:27 -08:00
Xueqing Liu	bd66e40296	fixing load best model at the end (#389 )	2022-01-11 10:47:53 -08:00
Xueqing Liu	c54c1246c6	fixing auto metric bug (#387 )	2022-01-07 16:25:58 -08:00
Kevin Chen	d4273669e6	Time series forecasting with sklearn regressors (#362 ) * add sklearn regressors as learners for ts_forecast task * add direct forecasting strategy warnings and errors for duplicate rows and missing values - add preprocess for sklearn time series forecast update automl.py update test/test_forecast.py * update model.py and test_forecast.py for cv eval_method * add "hcrystalball" dependency in setup.py * update automl.py - add _validate_ts_data function for abstraction - include xgb_limitdepth as a learner * update model.py - update search space for sklearn ts regressors * update automl.py and test_forecast.py for numpy array inputs * add documentations to model.py * add documentation for removing catboost regressor * update automl.py - _validate_ts_data() function Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com>	2022-01-06 23:12:38 -08:00
Chi Wang	612668e8ed	serialize TransformerEstimator (#381 ) * serialize TransformerEstimator * check has_attr * custom metric needs trainer * skip test on mac	2022-01-06 10:28:19 -08:00
Chi Wang	cd9740f022	Fix several issues for nlp tasks (#380 ) * num cpu issue #378; * temp fix for ray issue #379; * transformers version.	2022-01-05 13:49:12 -08:00
Xueqing Liu	207b6935d9	adding token classification (#376 ) * adding ner	2022-01-03 13:44:10 -05:00
Chi Wang	8602def1c4	logging (#371 ) * query logged runs * mlflow log when using ray * key check for newer version of ray #363 * catch importerror * log and load AutoML model * retrain if necessary when ensemble fails	2022-01-02 21:37:19 -08:00
oberonbot	9c00e4272a	Finish the Multiple Choice Classification (#367 ) * adding multiple choice * update test cases (hard coded) * merged common code in predict_proba and predict in TransformersEstimator	2022-01-02 20:12:34 -05:00
Xueqing Liu	b2900f4b22	fixing custom metric (#357 ) * fixing the error for custom metric	2021-12-24 16:23:09 -05:00
Xueqing Liu	dcfd218108	Fixing the bug in custom metric (#356 ) * fixing the bug for custom metric	2021-12-23 18:44:53 -05:00
Xueqing Liu	ee3162e232	Adding the NLP task summarization (#346 ) * Add test_autohf_summarization.py * adding seq2seq * Update flaml/nlp/huggingface/trainer.py * rouge metrics Co-authored-by: XinZofStevens <xzhao4346@gmail.com> Co-authored-by: JinzhuoWu <wujinzhuo0105@gmail.com> Co-authored-by: Chi Wang <wang.chi@microsoft.com>	2021-12-20 14:19:32 -08:00
Chi Wang	efd85b4c86	Deploy a new doc website (#338 ) A new documentation website. And: * add actions for doc * update docstr * installation instructions for doc dev * unify README and Getting Started * rename notebook * doc about best_model_for_estimator #340 * docstr for keep_search_state #340 * DNN Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu> Co-authored-by: Z.sk <shaokunzhang@psu.edu>	2021-12-16 17:11:33 -08:00
Chi Wang	434586e2e2	train at least one iter when not trained (#336 ) * train at least one iter when not trained * bump version to 0.9.1	2021-12-12 20:05:18 -08:00
Xueqing Liu	1a3e01c352	adding HF metrics (#335 ) * adding nlp metrics * fix ndcg	2021-12-10 12:32:49 -05:00
Chi Wang	54d303a95a	bug fix in confg2params (#323 ) * bug fix in confg2params * set the task property before config2params	2021-12-03 19:37:49 -08:00
Xueqing Liu	fb59bb9928	adding TODOs for NLP module, so students can implement other tasks easier (#321 ) * fixing ray pickle bug, skipping macosx bug, completing code for seqregression * catching connectionerror * ading TODOs for NLP module	2021-12-03 12:45:16 -05:00
Chi Wang	c57954fbbd	include default value in rf search space (#317 ) * include default value in rf search space * init _mem_per_iter with -1 * bump version to 0.8.2 * docstr for search space's arguments	2021-12-03 09:15:21 -08:00
liususan091219	63f402b29e	fixing config2params for transformersestimator	2021-11-26 21:28:38 -08:00
Xueqing Liu	fd136b02d1	bug fix for TransformerEstimator (#293 ) * fix checkpoint naming + trial id for non-ray mode, fix the bug in running test mode, delete all the checkpoints in non-ray mode * finished testing for checkpoint naming, delete checkpoint, ray, max iter = 1 * adding predict_proba, address PR 293's comments close #293 #291	2021-11-23 11:26:39 -08:00
Chi Wang	ea6d28d7bd	add max_depth to xgboost search space (#282 ) * add max_depth to xgboost search space * notebook update * two learners for xgboost (max_depth or max_leaves)	2021-11-22 21:17:48 -08:00
Chi Wang	72caa2172d	model_history, ITER_HP, settings in AutoML(), checkpoint bug fix (#283 ) if save_best_model_per_estimator is False and retrain_final is True, unfit the model after evaluation in HPO. retrain if using ray. update ITER_HP in config after a trial is finished. change prophet logging level. example and notebook update. allow settings to be passed to AutoML constructor. Are you planning to add multi-output-regression capability to FLAML #192 Is multi-tasking allowed? #277 can pass the auotml setting to the constructor instead of requiring a derived class. remove model_history. checkpoint bug fix. * model_history meaning save_best_model_per_estimator * ITER_HP * example update * prophet logging level * comment update in forecast notebook * print format improvement * allow settings to be passed to AutoML constructor * checkpoint bug fix * time limit for autohf regression test * skip slow test on macos * cleanup before del	2021-11-18 09:39:45 -08:00
Xueqing Liu	42de3075e9	Make NLP tasks available from AutoML.fit() (#210 ) Sequence classification and regression: "seq-classification" and "seq-regression" Co-authored-by: Chi Wang <wang.chi@microsoft.com>	2021-11-16 11:06:20 -08:00
Chi Wang	0d9439212f	update docstr	2021-11-06 09:37:33 -07:00
Chi Wang	549a0dfb53	limit time and memory consumption (#264 ) * limit time and memory * separate tests * lrl1 can't be limited by limit_resource * free memory when possible * passthrough=False when ensemble fails; retrain when trained_estimator is None * use callback to for resource limit * handle lower version of xgb with no callback * free mem ratio * reduce verbosity * retrain_final when max_iter==1 * remove trained_estimator from result * model_history * wheel * retrain time as best_config_train_time * ci: libomp version for xgboost on macos * limit_resource not working in windows * test pickle load * mute forecaster * notebook update * check hard * preventive callback * add use_ray	2021-11-03 19:08:23 -07:00
Kevin Chen	519bfc2a18	Integrate multivariate time series forecasting (#254 ) * Integrate multivariate time series forecasting, now supports continuous and categorical variables - update data.py to transform time series data - update search space - update documentations to reflect changes - update test_forecast.py - rename 'forecast' task to 'ts_forecast' task * update automl.py and test_forecast.py * update forecast notebook * update README.md and setup.py * update ml.py and test_forecast.py - make "ds" and "y" constant variables * replace constants with constant variables * bump version to 0.7.0 * update setup.py - support 'forecast' and 'ts_forecast' * update automl.py and data.py - support 'forecast' and 'ts_forecast' tasks	2021-10-30 09:48:57 -07:00
Chi Wang	7d6e860102	n_estimators for catboost	2021-10-18 21:56:21 -07:00
Chi Wang	b2d8b097d7	check n_iter == 1	2021-10-18 21:56:21 -07:00
Chi Wang	b03a87e737	no search when max_iter < 2	2021-10-18 21:56:21 -07:00

1 2

77 Commits