autogen/test/nlp/test_autohf.py

import sys
import pytest
import requests
from utils import get_toy_data_seqclassification, get_automl_settings
import os
import shutil


@pytest.mark.skipif(sys.platform == "darwin", reason="do not run on mac os")
def test_hf_data():
    from flaml import AutoML

    X_train, y_train, X_val, y_val, X_test = get_toy_data_seqclassification()

    automl = AutoML()

    automl_settings = get_automl_settings()
    automl_settings["preserve_checkpoint"] = False

    try:
        automl.fit(
            X_train=X_train,
            y_train=y_train,
            X_val=X_val,
            y_val=y_val,
            **automl_settings
        )
        automl.score(X_val, y_val, **{"metric": "accuracy"})
        automl.pickle("automl.pkl")
    except requests.exceptions.HTTPError:
        return

    import json

    with open("seqclass.log", "r") as fin:
        for line in fin:
            each_log = json.loads(line.strip("\n"))
            if "validation_loss" in each_log:
                val_loss = each_log["validation_loss"]
                min_inter_result = min(
                    each_dict.get("eval_automl_metric", sys.maxsize)
                    for each_dict in each_log["logged_metric"]["intermediate_results"]
                )

                if min_inter_result != sys.maxsize:
                    assert val_loss == min_inter_result

    automl = AutoML()

    automl_settings.pop("max_iter", None)
    automl_settings.pop("use_ray", None)
    automl_settings.pop("estimator_list", None)

    automl.retrain_from_log(
        X_train=X_train,
        y_train=y_train,
        train_full=True,
        record_id=0,
        **automl_settings
    )
    automl.predict(X_test, **{"per_device_eval_batch_size": 2})
    automl.predict(["test test", "test test"])
    automl.predict(
        [
            ["test test", "test test"],
            ["test test", "test test"],
            ["test test", "test test"],
        ]
    )

    automl.predict_proba(X_test)
    print(automl.classes_)

    del automl

    if os.path.exists("test/data/output/"):
        try:
            shutil.rmtree("test/data/output/")
        except PermissionError:
            print("PermissionError when deleting test/data/output/")


if __name__ == "__main__":
    test_hf_data()
adding TODOs for NLP module, so students can implement other tasks easier (#321) * fixing ray pickle bug, skipping macosx bug, completing code for seqregression * catching connectionerror * ading TODOs for NLP module 2021-12-03 12:45:16 -05:00			`import sys`
model_history, ITER_HP, settings in AutoML(), checkpoint bug fix (#283) if save_best_model_per_estimator is False and retrain_final is True, unfit the model after evaluation in HPO. retrain if using ray. update ITER_HP in config after a trial is finished. change prophet logging level. example and notebook update. allow settings to be passed to AutoML constructor. Are you planning to add multi-output-regression capability to FLAML #192 Is multi-tasking allowed? #277 can pass the auotml setting to the constructor instead of requiring a derived class. remove model_history. checkpoint bug fix. * model_history meaning save_best_model_per_estimator * ITER_HP * example update * prophet logging level * comment update in forecast notebook * print format improvement * allow settings to be passed to AutoML constructor * checkpoint bug fix * time limit for autohf regression test * skip slow test on macos * cleanup before del 2021-11-18 09:39:45 -08:00			`import pytest`
adding catch for HTTP error (#432) 2022-01-30 01:53:32 -05:00			`import requests`
refactoring TransformersEstimator to support default and custom_hp (#511) * refactoring TransformersEstimator to support default and custom_hp * handling starting_points not in search space * addressing starting point more than max_iter * fixing upper < lower bug 2022-04-28 14:06:29 -04:00			`from utils import get_toy_data_seqclassification, get_automl_settings`
Remove NLP classification head (#756) * rm classification head in nlp * rm classification head in nlp * rm classification head in nlp * adding test cases for switch classification head * adding test cases for switch classification head * Update test/nlp/test_autohf_classificationhead.py Co-authored-by: Chi Wang <wang.chi@microsoft.com> * adding test cases for switch classification head * run each test separately * skip classification head test on windows * disabling wandb reporting * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * fix test nlp custom metric Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2022-10-12 20:04:42 -04:00			`import os`
			`import shutil`
model_history, ITER_HP, settings in AutoML(), checkpoint bug fix (#283) if save_best_model_per_estimator is False and retrain_final is True, unfit the model after evaluation in HPO. retrain if using ray. update ITER_HP in config after a trial is finished. change prophet logging level. example and notebook update. allow settings to be passed to AutoML constructor. Are you planning to add multi-output-regression capability to FLAML #192 Is multi-tasking allowed? #277 can pass the auotml setting to the constructor instead of requiring a derived class. remove model_history. checkpoint bug fix. * model_history meaning save_best_model_per_estimator * ITER_HP * example update * prophet logging level * comment update in forecast notebook * print format improvement * allow settings to be passed to AutoML constructor * checkpoint bug fix * time limit for autohf regression test * skip slow test on macos * cleanup before del 2021-11-18 09:39:45 -08:00

adding TODOs for NLP module, so students can implement other tasks easier (#321) * fixing ray pickle bug, skipping macosx bug, completing code for seqregression * catching connectionerror * ading TODOs for NLP module 2021-12-03 12:45:16 -05:00			`@pytest.mark.skipif(sys.platform == "darwin", reason="do not run on mac os")`
Make NLP tasks available from AutoML.fit() (#210) Sequence classification and regression: "seq-classification" and "seq-regression" Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2021-11-16 14:06:20 -05:00			`def test_hf_data():`
			`from flaml import AutoML`
remove redundant imports (#426) * remove redundant imports * getting ride of hf dataset 2022-01-24 17:24:14 -05:00
refactoring TransformersEstimator to support default and custom_hp (#511) * refactoring TransformersEstimator to support default and custom_hp * handling starting_points not in search space * addressing starting point more than max_iter * fixing upper < lower bug 2022-04-28 14:06:29 -04:00			`X_train, y_train, X_val, y_val, X_test = get_toy_data_seqclassification()`
Make NLP tasks available from AutoML.fit() (#210) Sequence classification and regression: "seq-classification" and "seq-regression" Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2021-11-16 14:06:20 -05:00
			`automl = AutoML()`

refactoring TransformersEstimator to support default and custom_hp (#511) * refactoring TransformersEstimator to support default and custom_hp * handling starting_points not in search space * addressing starting point more than max_iter * fixing upper < lower bug 2022-04-28 14:06:29 -04:00			`automl_settings = get_automl_settings()`
Add preserve_checkpoint to preserve the checkpoint after del (#692) * fix del bug 2022-08-20 18:17:10 -04:00			`automl_settings["preserve_checkpoint"] = False`
Make NLP tasks available from AutoML.fit() (#210) Sequence classification and regression: "seq-classification" and "seq-regression" Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2021-11-16 14:06:20 -05:00
adding catch for HTTP error (#432) 2022-01-30 01:53:32 -05:00			`try:`
			`automl.fit(`
			`X_train=X_train,`
			`y_train=y_train,`
			`X_val=X_val,`
			`y_val=y_val,`
			`**automl_settings`
			`)`
adding evaluation (#495) * adding automl.score * fixing the metric name in train_with_config * adding pickle after score * fixing a bug in automl.pickle 2022-03-25 17:00:08 -04:00			`automl.score(X_val, y_val, **{"metric": "accuracy"})`
			`automl.pickle("automl.pkl")`
adding catch for HTTP error (#432) 2022-01-30 01:53:32 -05:00			`except requests.exceptions.HTTPError:`
			`return`
serialize TransformerEstimator (#381) * serialize TransformerEstimator * check has_attr * custom metric needs trainer * skip test on mac 2022-01-06 10:28:19 -08:00
Fix HPO evaluation bug (#645) * fix eval automl metric bug on val_loss inconsistency * updating starting point search space to continuous * shortening notebok 2022-07-28 23:08:42 -04:00			`import json`

			`with open("seqclass.log", "r") as fin:`
			`for line in fin:`
			`each_log = json.loads(line.strip("\n"))`
			`if "validation_loss" in each_log:`
			`val_loss = each_log["validation_loss"]`
			`min_inter_result = min(`
			`each_dict.get("eval_automl_metric", sys.maxsize)`
			`for each_dict in each_log["logged_metric"]["intermediate_results"]`
			`)`

			`if min_inter_result != sys.maxsize:`
			`assert val_loss == min_inter_result`

Make NLP tasks available from AutoML.fit() (#210) Sequence classification and regression: "seq-classification" and "seq-regression" Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2021-11-16 14:06:20 -05:00			`automl = AutoML()`
refactoring TransformersEstimator to support default and custom_hp (#511) * refactoring TransformersEstimator to support default and custom_hp * handling starting_points not in search space * addressing starting point more than max_iter * fixing upper < lower bug 2022-04-28 14:06:29 -04:00
			`automl_settings.pop("max_iter", None)`
			`automl_settings.pop("use_ray", None)`
			`automl_settings.pop("estimator_list", None)`

Make NLP tasks available from AutoML.fit() (#210) Sequence classification and regression: "seq-classification" and "seq-regression" Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2021-11-16 14:06:20 -05:00			`automl.retrain_from_log(`
			`X_train=X_train,`
			`y_train=y_train,`
			`train_full=True,`
			`record_id=0,`
			`**automl_settings`
			`)`
Issue724 (#745) * fixing issue724 * fixing issue724 2022-10-04 10:51:12 -04:00			`automl.predict(X_test, **{"per_device_eval_batch_size": 2})`
Make NLP tasks available from AutoML.fit() (#210) Sequence classification and regression: "seq-classification" and "seq-regression" Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2021-11-16 14:06:20 -05:00			`automl.predict(["test test", "test test"])`
			`automl.predict(`
			`[`
			`["test test", "test test"],`
			`["test test", "test test"],`
			`["test test", "test test"],`
			`]`
			`)`

bug fix for TransformerEstimator (#293) * fix checkpoint naming + trial id for non-ray mode, fix the bug in running test mode, delete all the checkpoints in non-ray mode * finished testing for checkpoint naming, delete checkpoint, ray, max iter = 1 * adding predict_proba, address PR 293's comments close #293 #291 2021-11-23 14:26:39 -05:00			`automl.predict_proba(X_test)`
			`print(automl.classes_)`

Add preserve_checkpoint to preserve the checkpoint after del (#692) * fix del bug 2022-08-20 18:17:10 -04:00			`del automl`

Remove NLP classification head (#756) * rm classification head in nlp * rm classification head in nlp * rm classification head in nlp * adding test cases for switch classification head * adding test cases for switch classification head * Update test/nlp/test_autohf_classificationhead.py Co-authored-by: Chi Wang <wang.chi@microsoft.com> * adding test cases for switch classification head * run each test separately * skip classification head test on windows * disabling wandb reporting * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * fix test nlp custom metric Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2022-10-12 20:04:42 -04:00			`if os.path.exists("test/data/output/"):`
install editable package in codespace (#826) * install editable package in codespace * fix test error in test_forecast * fix test error in test_space * openml version * break tests; pre-commit * skip on py10+win32 * install mlflow in test * install mlflow in [test] * skip test in windows * import * handle PermissionError * skip test in windows * skip test in windows * skip test in windows * skip test in windows * remove ts_forecast_panel from doc 2022-11-27 11:22:54 -08:00			`try:`
			`shutil.rmtree("test/data/output/")`
			`except PermissionError:`
			`print("PermissionError when deleting test/data/output/")`
Remove NLP classification head (#756) * rm classification head in nlp * rm classification head in nlp * rm classification head in nlp * adding test cases for switch classification head * adding test cases for switch classification head * Update test/nlp/test_autohf_classificationhead.py Co-authored-by: Chi Wang <wang.chi@microsoft.com> * adding test cases for switch classification head * run each test separately * skip classification head test on windows * disabling wandb reporting * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * fix test nlp custom metric Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2022-10-12 20:04:42 -04:00
Make NLP tasks available from AutoML.fit() (#210) Sequence classification and regression: "seq-classification" and "seq-regression" Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2021-11-16 14:06:20 -05:00
bug fix for TransformerEstimator (#293) * fix checkpoint naming + trial id for non-ray mode, fix the bug in running test mode, delete all the checkpoints in non-ray mode * finished testing for checkpoint naming, delete checkpoint, ray, max iter = 1 * adding predict_proba, address PR 293's comments close #293 #291 2021-11-23 14:26:39 -05:00			`if __name__ == "__main__":`
			`test_hf_data()`