autogen/test/nlp/test_autohf_tokenclassification.py

import sys
import pytest
import requests
import os
import shutil
from utils import (
    get_toy_data_tokenclassification_idlabel,
    get_toy_data_tokenclassification_tokenlabel,
    get_automl_settings,
)


@pytest.mark.skipif(
    sys.platform in ["darwin", "win32"] or sys.version < "3.7",
    reason="do not run on mac os, windows or py<3.7",
)
def test_tokenclassification_idlabel():
    from flaml import AutoML

    X_train, y_train, X_val, y_val = get_toy_data_tokenclassification_idlabel()
    automl = AutoML()

    automl_settings = get_automl_settings()
    automl_settings["task"] = "token-classification"
    automl_settings[
        "metric"
    ] = "seqeval:overall_f1"  # evaluating based on the overall_f1 of seqeval
    automl_settings["fit_kwargs_by_estimator"]["transformer"]["label_list"] = [
        "O",
        "B-PER",
        "I-PER",
        "B-ORG",
        "I-ORG",
        "B-LOC",
        "I-LOC",
        "B-MISC",
        "I-MISC",
    ]

    try:
        automl.fit(
            X_train=X_train,
            y_train=y_train,
            X_val=X_val,
            y_val=y_val,
            **automl_settings
        )
    except requests.exceptions.HTTPError:
        return

    # perf test
    import json

    with open("seqclass.log", "r") as fin:
        for line in fin:
            each_log = json.loads(line.strip("\n"))
            if "validation_loss" in each_log:
                val_loss = each_log["validation_loss"]
                min_inter_result = min(
                    each_dict.get("eval_automl_metric", sys.maxsize)
                    for each_dict in each_log["logged_metric"]["intermediate_results"]
                )

                if min_inter_result != sys.maxsize:
                    assert val_loss == min_inter_result

    if os.path.exists("test/data/output/"):
        try:
            shutil.rmtree("test/data/output/")
        except PermissionError:
            print("PermissionError when deleting test/data/output/")


@pytest.mark.skipif(
    sys.platform in ["darwin", "win32"] or sys.version < "3.7",
    reason="do not run on mac os, windows or py<3.7",
)
def test_tokenclassification_tokenlabel():
    from flaml import AutoML

    X_train, y_train, X_val, y_val = get_toy_data_tokenclassification_tokenlabel()
    automl = AutoML()

    automl_settings = get_automl_settings()
    automl_settings["task"] = "token-classification"
    automl_settings[
        "metric"
    ] = "seqeval:overall_f1"  # evaluating based on the overall_f1 of seqeval

    try:
        automl.fit(
            X_train=X_train,
            y_train=y_train,
            X_val=X_val,
            y_val=y_val,
            **automl_settings
        )
    except requests.exceptions.HTTPError:
        return

    # perf test
    import json

    with open("seqclass.log", "r") as fin:
        for line in fin:
            each_log = json.loads(line.strip("\n"))
            if "validation_loss" in each_log:
                val_loss = each_log["validation_loss"]
                min_inter_result = min(
                    each_dict.get("eval_automl_metric", sys.maxsize)
                    for each_dict in each_log["logged_metric"]["intermediate_results"]
                )

                if min_inter_result != sys.maxsize:
                    assert val_loss == min_inter_result

    if os.path.exists("test/data/output/"):
        try:
            shutil.rmtree("test/data/output/")
        except PermissionError:
            print("PermissionError when deleting test/data/output/")


if __name__ == "__main__":
    test_tokenclassification_idlabel()
adding token classification (#376) * adding ner 2022-01-03 13:44:10 -05:00			`import sys`
			`import pytest`
adding catch for HTTP error (#432) 2022-01-30 01:53:32 -05:00			`import requests`
Remove NLP classification head (#756) * rm classification head in nlp * rm classification head in nlp * rm classification head in nlp * adding test cases for switch classification head * adding test cases for switch classification head * Update test/nlp/test_autohf_classificationhead.py Co-authored-by: Chi Wang <wang.chi@microsoft.com> * adding test cases for switch classification head * run each test separately * skip classification head test on windows * disabling wandb reporting * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * fix test nlp custom metric Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2022-10-12 20:04:42 -04:00			`import os`
			`import shutil`
fix ner bug; refactor post processing of TransformersEstimator prediction (#615) * fix ner bug; refactor post processing * fix too many values to unpack * supporting id/token label for NER 2022-07-05 13:38:21 -04:00			`from utils import (`
			`get_toy_data_tokenclassification_idlabel,`
			`get_toy_data_tokenclassification_tokenlabel,`
			`get_automl_settings,`
			`)`
adding token classification (#376) * adding ner 2022-01-03 13:44:10 -05:00

Py36 (#614) * allow installation in py 3.6 * test py 3.6 2022-06-26 08:32:28 -07:00			`@pytest.mark.skipif(`
install editable package in codespace (#826) * install editable package in codespace * fix test error in test_forecast * fix test error in test_space * openml version * break tests; pre-commit * skip on py10+win32 * install mlflow in test * install mlflow in [test] * skip test in windows * import * handle PermissionError * skip test in windows * skip test in windows * skip test in windows * skip test in windows * remove ts_forecast_panel from doc 2022-11-27 11:22:54 -08:00			`sys.platform in ["darwin", "win32"] or sys.version < "3.7",`
			`reason="do not run on mac os, windows or py<3.7",`
Py36 (#614) * allow installation in py 3.6 * test py 3.6 2022-06-26 08:32:28 -07:00			`)`
fix ner bug; refactor post processing of TransformersEstimator prediction (#615) * fix ner bug; refactor post processing * fix too many values to unpack * supporting id/token label for NER 2022-07-05 13:38:21 -04:00			`def test_tokenclassification_idlabel():`
adding token classification (#376) * adding ner 2022-01-03 13:44:10 -05:00			`from flaml import AutoML`

fix ner bug; refactor post processing of TransformersEstimator prediction (#615) * fix ner bug; refactor post processing * fix too many values to unpack * supporting id/token label for NER 2022-07-05 13:38:21 -04:00			`X_train, y_train, X_val, y_val = get_toy_data_tokenclassification_idlabel()`
adding token classification (#376) * adding ner 2022-01-03 13:44:10 -05:00			`automl = AutoML()`

refactoring TransformersEstimator to support default and custom_hp (#511) * refactoring TransformersEstimator to support default and custom_hp * handling starting_points not in search space * addressing starting point more than max_iter * fixing upper < lower bug 2022-04-28 14:06:29 -04:00			`automl_settings = get_automl_settings()`
			`automl_settings["task"] = "token-classification"`
Py36 (#614) * allow installation in py 3.6 * test py 3.6 2022-06-26 08:32:28 -07:00			`automl_settings[`
			`"metric"`
			`] = "seqeval:overall_f1" # evaluating based on the overall_f1 of seqeval`
fix the post-processing bug in NER (#534) * fix conll bug * update DataCollatorForAuto * adding label_list comments 2022-05-10 17:22:57 -04:00			`automl_settings["fit_kwargs_by_estimator"]["transformer"]["label_list"] = [`
			`"O",`
			`"B-PER",`
			`"I-PER",`
			`"B-ORG",`
			`"I-ORG",`
			`"B-LOC",`
			`"I-LOC",`
			`"B-MISC",`
			`"I-MISC",`
			`]`
adding token classification (#376) * adding ner 2022-01-03 13:44:10 -05:00
adding catch for HTTP error (#432) 2022-01-30 01:53:32 -05:00			`try:`
			`automl.fit(`
			`X_train=X_train,`
			`y_train=y_train,`
			`X_val=X_val,`
			`y_val=y_val,`
			`**automl_settings`
			`)`
			`except requests.exceptions.HTTPError:`
			`return`
adding token classification (#376) * adding ner 2022-01-03 13:44:10 -05:00
fix ner bug; refactor post processing of TransformersEstimator prediction (#615) * fix ner bug; refactor post processing * fix too many values to unpack * supporting id/token label for NER 2022-07-05 13:38:21 -04:00			`# perf test`
			`import json`

			`with open("seqclass.log", "r") as fin:`
			`for line in fin:`
			`each_log = json.loads(line.strip("\n"))`
			`if "validation_loss" in each_log:`
			`val_loss = each_log["validation_loss"]`
			`min_inter_result = min(`
			`each_dict.get("eval_automl_metric", sys.maxsize)`
			`for each_dict in each_log["logged_metric"]["intermediate_results"]`
			`)`

			`if min_inter_result != sys.maxsize:`
			`assert val_loss == min_inter_result`

Remove NLP classification head (#756) * rm classification head in nlp * rm classification head in nlp * rm classification head in nlp * adding test cases for switch classification head * adding test cases for switch classification head * Update test/nlp/test_autohf_classificationhead.py Co-authored-by: Chi Wang <wang.chi@microsoft.com> * adding test cases for switch classification head * run each test separately * skip classification head test on windows * disabling wandb reporting * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * fix test nlp custom metric Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2022-10-12 20:04:42 -04:00			`if os.path.exists("test/data/output/"):`
install editable package in codespace (#826) * install editable package in codespace * fix test error in test_forecast * fix test error in test_space * openml version * break tests; pre-commit * skip on py10+win32 * install mlflow in test * install mlflow in [test] * skip test in windows * import * handle PermissionError * skip test in windows * skip test in windows * skip test in windows * skip test in windows * remove ts_forecast_panel from doc 2022-11-27 11:22:54 -08:00			`try:`
			`shutil.rmtree("test/data/output/")`
			`except PermissionError:`
			`print("PermissionError when deleting test/data/output/")`
Remove NLP classification head (#756) * rm classification head in nlp * rm classification head in nlp * rm classification head in nlp * adding test cases for switch classification head * adding test cases for switch classification head * Update test/nlp/test_autohf_classificationhead.py Co-authored-by: Chi Wang <wang.chi@microsoft.com> * adding test cases for switch classification head * run each test separately * skip classification head test on windows * disabling wandb reporting * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * fix test nlp custom metric Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2022-10-12 20:04:42 -04:00
fix ner bug; refactor post processing of TransformersEstimator prediction (#615) * fix ner bug; refactor post processing * fix too many values to unpack * supporting id/token label for NER 2022-07-05 13:38:21 -04:00
			`@pytest.mark.skipif(`
install editable package in codespace (#826) * install editable package in codespace * fix test error in test_forecast * fix test error in test_space * openml version * break tests; pre-commit * skip on py10+win32 * install mlflow in test * install mlflow in [test] * skip test in windows * import * handle PermissionError * skip test in windows * skip test in windows * skip test in windows * skip test in windows * remove ts_forecast_panel from doc 2022-11-27 11:22:54 -08:00			`sys.platform in ["darwin", "win32"] or sys.version < "3.7",`
			`reason="do not run on mac os, windows or py<3.7",`
fix ner bug; refactor post processing of TransformersEstimator prediction (#615) * fix ner bug; refactor post processing * fix too many values to unpack * supporting id/token label for NER 2022-07-05 13:38:21 -04:00			`)`
			`def test_tokenclassification_tokenlabel():`
			`from flaml import AutoML`

			`X_train, y_train, X_val, y_val = get_toy_data_tokenclassification_tokenlabel()`
			`automl = AutoML()`

			`automl_settings = get_automl_settings()`
			`automl_settings["task"] = "token-classification"`
			`automl_settings[`
			`"metric"`
			`] = "seqeval:overall_f1" # evaluating based on the overall_f1 of seqeval`

			`try:`
			`automl.fit(`
			`X_train=X_train,`
			`y_train=y_train,`
			`X_val=X_val,`
			`y_val=y_val,`
			`**automl_settings`
			`)`
			`except requests.exceptions.HTTPError:`
			`return`

			`# perf test`
			`import json`

			`with open("seqclass.log", "r") as fin:`
			`for line in fin:`
			`each_log = json.loads(line.strip("\n"))`
			`if "validation_loss" in each_log:`
			`val_loss = each_log["validation_loss"]`
			`min_inter_result = min(`
			`each_dict.get("eval_automl_metric", sys.maxsize)`
			`for each_dict in each_log["logged_metric"]["intermediate_results"]`
			`)`

			`if min_inter_result != sys.maxsize:`
			`assert val_loss == min_inter_result`

Remove NLP classification head (#756) * rm classification head in nlp * rm classification head in nlp * rm classification head in nlp * adding test cases for switch classification head * adding test cases for switch classification head * Update test/nlp/test_autohf_classificationhead.py Co-authored-by: Chi Wang <wang.chi@microsoft.com> * adding test cases for switch classification head * run each test separately * skip classification head test on windows * disabling wandb reporting * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * fix test nlp custom metric Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2022-10-12 20:04:42 -04:00			`if os.path.exists("test/data/output/"):`
install editable package in codespace (#826) * install editable package in codespace * fix test error in test_forecast * fix test error in test_space * openml version * break tests; pre-commit * skip on py10+win32 * install mlflow in test * install mlflow in [test] * skip test in windows * import * handle PermissionError * skip test in windows * skip test in windows * skip test in windows * skip test in windows * remove ts_forecast_panel from doc 2022-11-27 11:22:54 -08:00			`try:`
			`shutil.rmtree("test/data/output/")`
			`except PermissionError:`
			`print("PermissionError when deleting test/data/output/")`
Remove NLP classification head (#756) * rm classification head in nlp * rm classification head in nlp * rm classification head in nlp * adding test cases for switch classification head * adding test cases for switch classification head * Update test/nlp/test_autohf_classificationhead.py Co-authored-by: Chi Wang <wang.chi@microsoft.com> * adding test cases for switch classification head * run each test separately * skip classification head test on windows * disabling wandb reporting * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * fix test nlp custom metric * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * Update website/docs/Examples/AutoML-NLP.md Co-authored-by: Chi Wang <wang.chi@microsoft.com> * fix test nlp custom metric Co-authored-by: Chi Wang <wang.chi@microsoft.com> 2022-10-12 20:04:42 -04:00
adding token classification (#376) * adding ner 2022-01-03 13:44:10 -05:00
			`if __name__ == "__main__":`
fix ner bug; refactor post processing of TransformersEstimator prediction (#615) * fix ner bug; refactor post processing * fix too many values to unpack * supporting id/token label for NER 2022-07-05 13:38:21 -04:00			`test_tokenclassification_idlabel()`