360 Commits

Author SHA1 Message Date
Z.sk
d7ff258b89
Merge pull request #2 from microsoft/main
update
2022-01-14 23:07:06 +08:00
Xueqing Liu
c1b5cb5348
fixing default metric for regression + change verbosity for transformers (#397)
* fixing default metric for regression + change verbosity for transformers

* fixing per_device_train_batch_size

* Update flaml/automl.py for gpu_per_trial
2022-01-13 21:08:51 -08:00
Chi Wang
8e72904b9e
postcss version update (#385) 2022-01-12 22:24:17 -08:00
Xueqing Liu
f41f1c2198
Logging multiple checkpoints (#394) 2022-01-12 19:50:39 -08:00
liususan091219
303d40c76c set verbose for transformers 2022-01-11 21:42:27 -08:00
Kevin Chen
99667dad5f
Regression forecast debug (#391)
* update automl.py
- fix bug with removing "catboost"
2022-01-11 13:16:59 -08:00
Xueqing Liu
bd66e40296
fixing load best model at the end (#389) 2022-01-11 10:47:53 -08:00
Z.sk
b66930e4f0
Merge pull request #1 from microsoft/main
update
2022-01-11 08:32:50 +08:00
Xueqing Liu
c54c1246c6
fixing auto metric bug (#387) 2022-01-07 16:25:58 -08:00
Kevin Chen
d4273669e6
Time series forecasting with sklearn regressors (#362)
* add sklearn regressors as learners for ts_forecast task

* add direct forecasting strategy
warnings and errors for duplicate rows and missing values

- add preprocess for sklearn time series forecast
 update automl.py
 update test/test_forecast.py

* update model.py and test_forecast.py for cv eval_method

* add "hcrystalball" dependency in setup.py

* update automl.py
- add _validate_ts_data function for abstraction
- include xgb_limitdepth as a learner

* update model.py
- update search space for sklearn ts regressors

* update automl.py and test_forecast.py for numpy array inputs

* add documentations to model.py

* add documentation for removing catboost regressor

* update automl.py
- _validate_ts_data() function

Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com>
2022-01-06 23:12:38 -08:00
Chi Wang
612668e8ed
serialize TransformerEstimator (#381)
* serialize TransformerEstimator

* check has_attr

* custom metric needs trainer

* skip test on mac
2022-01-06 10:28:19 -08:00
Chi Wang
cd9740f022
Fix several issues for nlp tasks (#380)
* num cpu issue #378;
* temp fix for ray issue #379;
* transformers version.
2022-01-05 13:49:12 -08:00
Chi Wang
8593a3fe27
citation file (#364) 2022-01-04 15:13:14 -08:00
Xueqing Liu
207b6935d9
adding token classification (#376)
* adding ner
2022-01-03 13:44:10 -05:00
Chi Wang
8602def1c4
logging (#371)
* query logged runs

* mlflow log when using ray

* key check for newer version of ray #363

* catch importerror

* log and load AutoML model

* retrain if necessary when ensemble fails
2022-01-02 21:37:19 -08:00
oberonbot
9c00e4272a
Finish the Multiple Choice Classification (#367)
* adding multiple choice

* update test cases (hard coded)

* merged common code in predict_proba and predict in TransformersEstimator
2022-01-02 20:12:34 -05:00
Chi Wang
2f5d6169d3
example update (#359)
update some examples for consistencies with others.
2021-12-25 16:13:39 -08:00
Xueqing Liu
b2900f4b22
fixing custom metric (#357)
* fixing the error for custom metric
2021-12-24 16:23:09 -05:00
Rui Zhuang
c6c0c29769
Simplify lgbm example (#358)
* simplify lgbm examples

* provide link to lgbm example script.

* simply lgbm example in the example script.

Co-authored-by: Chi Wang <wang.chi@microsoft.com>
2021-12-23 23:05:14 -08:00
Xueqing Liu
dcfd218108
Fixing the bug in custom metric (#356)
* fixing the bug for custom metric
2021-12-23 18:44:53 -05:00
Chi Wang
300f286667
azureml + ray (#344)
* examples and documentation about how to use azureml + ray
2021-12-23 13:37:07 -08:00
Chi Wang
baa0359324
doc update (#352)
* custom splitter
* NLP
* version number
2021-12-22 14:35:13 -08:00
Chi Wang
0b25e89f29
reproducibility for random sampling (#349)
* reproducibility for random sampling #236

* doc update
2021-12-22 12:12:25 -08:00
Xueqing Liu
ee3162e232
Adding the NLP task summarization (#346)
* Add test_autohf_summarization.py

* adding seq2seq

* Update flaml/nlp/huggingface/trainer.py

* rouge metrics

Co-authored-by: XinZofStevens <xzhao4346@gmail.com>
Co-authored-by: JinzhuoWu <wujinzhuo0105@gmail.com>
Co-authored-by: Chi Wang <wang.chi@microsoft.com>
2021-12-20 14:19:32 -08:00
Chi Wang
a340dbc554 url update for doc edit 2021-12-18 15:00:54 -08:00
Chi Wang
f3fc35c438 version update 2021-12-16 18:38:19 -08:00
Chi Wang
efd85b4c86
Deploy a new doc website (#338)
A new documentation website. And:

* add actions for doc

* update docstr

* installation instructions for doc dev

* unify README and Getting Started

* rename notebook

* doc about best_model_for_estimator #340

* docstr for keep_search_state #340

* DNN

Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu>
Co-authored-by: Z.sk <shaokunzhang@psu.edu>
2021-12-16 17:11:33 -08:00
Chia-Chi Hsu
671ccbbe3f
support for customized splitters (#333)
* add support for customized splitters

* use the param split_type for feeding generators

* use single API for customized splitter and add test

* when task==TS_FORCAST, always set shuffle=False

* update docstr

Co-authored-by: Chi Wang <wang.chi@microsoft.com>
2021-12-16 16:13:04 -08:00
Z.sk
7b24662dca
Makes the evaluation_function could receive the incumbent best result as input in Tune (#339)
* update tune function

* pass incumbent result to the training function

* Update test/tune/test_record_incumbent.py

* Update flaml/searcher/search_thread.py

* Update flaml/searcher/blendsearch.py

* Update flaml/tune/tune.py

* add constant variable

Co-authored-by: 张少坤 <zhangshaokun@fuzhi.ai>
Co-authored-by: Chi Wang <wang.chi@microsoft.com>
2021-12-15 21:12:47 -08:00
Chi Wang
228b00cbee
use cfo in tune.run if bs is not installed (#334)
Use cfo in tune.run if bs is not installed and issue a warning.
2021-12-12 21:10:06 -08:00
Chi Wang
434586e2e2
train at least one iter when not trained (#336)
* train at least one iter when not trained

* bump version to 0.9.1
2021-12-12 20:05:18 -08:00
Xueqing Liu
1a3e01c352
adding HF metrics (#335)
* adding nlp metrics

* fix ndcg
2021-12-10 12:32:49 -05:00
Chi Wang
b773e2898f Update flaml_pytorch_cifar10.ipynb
training_function -> train_cifar
2021-12-07 15:17:15 -08:00
Qingyun Wu
dd60dbc5eb
rename training_function (#327)
* rename training_function

* add docstr

* update docstr

* update docstr and comments

Co-authored-by: Qingyun Wu <qxw5138@psu.edu>
2021-12-06 17:03:43 -05:00
Chi Wang
3111084c07 add __init__.py in nlp 2021-12-06 09:15:39 -08:00
Qingyun Wu
17b17d084f
tune api for schedulers (#322)
* revise api and tests

* rename prune_attr

* update finetune notebook

* add scheduler test and notebook

* update tune api for scheduler

* remove scheduler notebook

* Update flaml/tune/tune.py

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* docstr

* fix imports

* clear notebook output

* fix ray import

* Update flaml/tune/tune.py

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* improve docstr

* Update flaml/searcher/blendsearch.py

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* remove redundant import

Co-authored-by: Qingyun Wu <qxw5138@psu.edu>
Co-authored-by: Chi Wang <wang.chi@microsoft.com>
2021-12-04 21:52:20 -05:00
Chi Wang
7d269435ae add save_best_config() 2021-12-04 16:29:52 -08:00
Chi Wang
54d303a95a
bug fix in confg2params (#323)
* bug fix in confg2params

* set the task property before config2params
2021-12-03 19:37:49 -08:00
Chi Wang
18230ed22f
pred_time_limit clarification and logging (#319)
* pred_time_limit clarification

* log prediction time

* handle ChunkedEncodingError in test
2021-12-03 16:02:00 -08:00
Xueqing Liu
fb59bb9928
adding TODOs for NLP module, so students can implement other tasks easier (#321)
* fixing ray pickle bug, skipping macosx bug, completing code for seqregression

* catching connectionerror

* ading TODOs for NLP module
2021-12-03 12:45:16 -05:00
Chi Wang
c57954fbbd
include default value in rf search space (#317)
* include default value in rf search space

* init _mem_per_iter with -1

* bump version to 0.8.2

* docstr for search space's arguments
2021-12-03 09:15:21 -08:00
Chi Wang
1545d5a6d2
skip cv preparation if eval_method is holdout (#314)
* skip cv preparation if eval_method is holdout

* bump version to 0.8.1
2021-11-28 11:18:55 -08:00
Chi Wang
2f25a87d98
Code quality improvement based on #275 (#313)
* simplify & restructure

Co-authored-by: Albern S <62778698+albernsrya@users.noreply.github.com>
2021-11-28 10:14:25 -08:00
liususan091219
63f402b29e fixing config2params for transformersestimator 2021-11-26 21:28:38 -08:00
Michal Chromcak
b0ef3b7995
Add conda forge minimal test (#309)
* add conda forge minimal test, create pytest markers
2021-11-25 13:58:20 -08:00
晓宇
5dc948da18
Update test_regression.py (#306)
* Update test_regression.py

There is a another way for mutioutput-model-trian.
RegressorChain is more adapting to the targets which are relavant.
2021-11-25 08:18:22 -08:00
Xueqing Liu
fd136b02d1
bug fix for TransformerEstimator (#293)
* fix checkpoint naming + trial id for non-ray mode, fix the bug in running test mode, delete all the checkpoints in non-ray mode

* finished testing for checkpoint naming, delete checkpoint, ray, max iter = 1

* adding predict_proba, address PR 293's comments

close #293 #291
2021-11-23 11:26:39 -08:00
Chi Wang
85e21864ce
test -> val; docstr (#300)
* rename test -> val in custom metric function
* add an example in docstr
resolve #299
2021-11-22 22:17:29 -08:00
Chi Wang
ea6d28d7bd
add max_depth to xgboost search space (#282)
* add max_depth to xgboost search space

* notebook update

* two learners for xgboost (max_depth or max_leaves)
2021-11-22 21:17:48 -08:00
Chi Wang
d937b03e42
multioutput regression (#292)
* make AutoML inherit sklearn.base.BaseEstimator such that it can be wrapped in sklearn.multioutput.MultiOutputRegressor for multi-output regression.

* moved and simplified preprocessing code in AutoML.predictI() to _preprocess()
2021-11-22 06:59:42 -08:00