Chi Wang
dffa802b3e
use_best_model for catboost ( #679 )
...
* use_best_model for catboost
* bump version to 1.0.11
2022-08-20 18:38:56 -07:00
Chi Wang
5e1059ab82
check config constraints for the initial config ( #685 )
...
* check config constraints for the initial config
* default config value
2022-08-15 05:30:23 -07:00
jmrichardson
e43485607a
Disable shuffle for custom CV ( #659 )
...
* Disable shuffle for custom CV
* Add custom fold shuffle test
* Update test_split.py
* Update test_split.py
2022-08-12 17:05:32 -07:00
Kevin Chen
f718d18b5e
time series forecasting with panel datasets ( #541 )
...
* time series forecasting with panel datasets
- integrate Temporal Fusion Transformer as a learner based on pytorchforecasting
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update setup.py
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update test_forecast.py
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update setup.py
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update setup.py
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update model.py and test_forecast.py
- remove blank lines
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update model.py to prevent errors
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update automl.py and data.py
- change forecast task name
- update documentation for fit() method
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update test_forecast.py
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update test_forecast.py
- add performance test
- use 'fit_kwargs_by_estimator'
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* add time index function
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update test_forecast.py performance test
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update data.py
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update automl.py
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update data.py to prevent type error
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update setup.py
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update for pytorch forecasting tft on panel datasets
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update automl.py documentations
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* - rename estimator
- add 'gpu_per_trial' for tft estimator
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update test_forecast.py
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* include ts panel forecasting as an example
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update model.py
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update documentations
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update automl_time_series_forecast.ipynb
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update documentations
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* "weights_summary" argument deprecated and removed for pl.Trainer()
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update model.py tft estimator prediction method
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update model.py
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update `fit_kwargs` documentation
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
* update automl.py
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
2022-08-12 08:39:22 -07:00
jmrichardson
25ad397d55
Skip transform ( #665 )
...
* Skip transform
* Fix logic and docstring, add test
* Add period ending to skip_transform doc
* Add skip_transform to retrain_from_log method
* Update test/automl/test_classification.py
Co-authored-by: Xueqing Liu <liususan091219@users.noreply.github.com >
Co-authored-by: Xueqing Liu <liususan091219@users.noreply.github.com >
2022-08-11 19:41:23 -04:00
Chi Wang
816a82a115
make test result more stable ( #646 )
2022-08-05 10:17:41 -07:00
Xueqing Liu
5eb5d43d7f
Fix HPO evaluation bug ( #645 )
...
* fix eval automl metric bug on val_loss inconsistency
* updating starting point search space to continuous
* shortening notebok
2022-07-28 23:08:42 -04:00
Chi Wang
e14e909af9
Feature names and importances ( #621 )
...
* feature names and importances
* None check
* StackingClassifier has no feature_importances_
* StackingClassifier has no feature_names_in_
2022-07-10 12:25:59 -07:00
Qingyun Wu
b7846048dc
Allow FLAML_sample_size in starting_points ( #619 )
...
* FLAML_sample_size
* clean up
* starting_points as a list
* catch AssertionError
* per estimator sample size
* import
* per estimator min_sample_size
* Update flaml/automl.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* Update test/automl/test_warmstart.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* add warnings
* adding more tests
* fix a bug in validating starting points
* improve test
* revise test
* revise test
* documentation about custom_hp
* doc and efficiency
* update test
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
2022-07-09 16:04:46 -04:00
Chi Wang
74cca60606
Allow custom GroupKFold object as split_type ( #616 )
...
* Allow custom GroupKFold object
* handle unpickle error for prophet 1.1
* eval_method in test_object()
2022-06-29 21:04:25 -07:00
Chi Wang
7d6822aa40
cath URLError
2022-06-24 08:07:26 -07:00
Chi Wang
4377d53a73
update got version ( #607 )
...
* update got version
* None check
2022-06-23 08:02:46 -07:00
Chi Wang
c45741a67b
support latest xgboost version ( #599 )
...
* support latest xgboost version
* Update test_classification.py
* Update
Exists problems when installing xgb1.6.1 in py3.6
* cleanup
* xgboost version
* remove time_budget_s in test
* remove redundancy
* stop support of python 3.6
Co-authored-by: zsk <shaokunzhang529@gmail.com >
Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu >
2022-06-21 18:59:07 -07:00
Chi Wang
f8cc38bc16
enable ensemble when using ray ( #583 )
...
* enable ensemble when using ray
* sanitize config
2022-06-10 21:28:47 -07:00
Chi Wang
0642b6e7bb
init value type match ( #575 )
...
* init value type match
* bump version to 1.0.6
* add a note about flaml version in notebook
* add note about mismatched ITER_HP
* catch SSLError when accessing OpenML data
* catch errors in autovw test
Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu >
2022-06-09 08:11:15 -07:00
Xueqing Liu
e0e317bfb1
fixing trainable and update function, completing NOTE ( #566 )
...
* fix checkpoint naming + trial id for non-ray mode, fix the bug in running test mode, delete all the checkpoints in non-ray mode
* finished testing for checkpoint naming, delete checkpoint, ray, max iter = 1
2022-06-03 15:19:22 -04:00
Chi Wang
1af682b7f5
update doc about scheduler exception ( #564 )
...
* update doc about scheduler exception
* remove assert
2022-05-31 17:21:57 -07:00
Chi Wang
49e8f7f028
use zeroshot when no budget is given; custom_hp ( #563 )
...
* use zeroshot when no budget is given; custom_hp
* update Getting-Started
* protobuf version
* X_val
2022-05-28 17:22:09 -07:00
Qiaochu Song
2851134052
Quick-fix ( #539 )
...
* fix doc string; enable label transform in automl.score
2022-05-19 11:43:34 -04:00
Chi Wang
7126b69ce0
choose n_jobs for ensemble according to n_jobs per learner ( #551 )
...
* set n_jobs in ensemble dict
* catch the ensemble error
* choose n_jobs for stacker
* clarify
2022-05-18 21:01:51 -07:00
Xueqing Liu
2a8decdc50
fix the post-processing bug in NER ( #534 )
...
* fix conll bug
* update DataCollatorForAuto
* adding label_list comments
2022-05-10 17:22:57 -04:00
Chi Wang
c1bb66980c
test reproducibility from retrain ( #533 )
2022-05-07 09:13:17 -07:00
Chi Wang
e877de6414
use ffill in forecasting example
2022-04-01 09:23:23 -07:00
Chi Wang
84f1ae7424
Bump minimist from 1.2.5 to 1.2.6 in /website ( #502 )
...
* Bump minimist from 1.2.5 to 1.2.6 in /website
* check best_config in test
2022-03-30 22:19:47 -07:00
Qingyun Wu
2cdc08a75a
update notebook and test
2022-03-30 19:11:10 -07:00
Chi Wang
9128c8811a
handle failing trials ( #505 )
...
* handle failing trials
* clarify when to return {}
* skip ensemble in accuracy check
2022-03-28 16:57:52 -07:00
Qingyun Wu
6c16e47e42
Bug fix and add documentation for metric_constraints ( #498 )
...
* metric constraint documentation
* update link
* update notebook
* fix a bug in adding 'time_total_s' to result
* use the default multiple factor from config file
* update notebook
* format
* improve test
* revise test budget for macos
* bug fix in adding time_total_s
* increase performance check budget
* revise test
* update notebook
* uncomment test
* remove redundancy
* clear output
* remove n_jobs
* remove constraint in notebook
* increase budget
* revise test
* add python version
* use getattr
* improve code robustness
Co-authored-by: Qingyun Wu <qxw5138@psu.edu >
2022-03-26 21:11:45 -04:00
Xueqing Liu
5f97532986
adding evaluation ( #495 )
...
* adding automl.score
* fixing the metric name in train_with_config
* adding pickle after score
* fixing a bug in automl.pickle
2022-03-25 17:00:08 -04:00
Xueqing Liu
af423463c3
fixing bug for ner ( #463 )
...
* fixing bug for ner
* removing global var
* adding class for trial counter
* adding notebook
* adding use_ray dict
* updating documentation for nlp
2022-03-20 22:03:02 -04:00
Qingyun Wu
f6ae1331f5
metric constraints in flaml.automl ( #479 )
...
* metric constraints
* revise docstr
* fix docstr
* improve docstr
* Update flaml/automl.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* Update flaml/automl.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* Update flaml/automl.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* docstr
Co-authored-by: Qingyun Wu <qxw5138@psu.edu >
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
2022-03-12 00:39:35 -05:00
Kevin Chen
f9eda0cc40
update documentation for time series forecasting ( #472 )
...
* update automl.py
- documentation update
* update test_forecast.py
* update model.py
* update automl_time_series_forecast.ipynb
* update time series forecast website examples
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
2022-03-08 11:21:18 -08:00
Chi Wang
31ac984c4b
don't init global search with points_to_evaluate unless evaluated_rewards is provided; handle callbacks in fit kwargs ( #469 )
2022-03-01 18:39:16 -08:00
Chi Wang
8a44dd4318
data in csv ( #430 )
...
* data in csv
* support ray ObjectRef #365
* use object store to store data when using ray
* make lgbm tuning example a test
* homepage title
2022-01-30 19:36:41 -08:00
Kevin Chen
c75f97b475
Change the upper bound for "lags" hyperparameter for sklearn forecast models ( #437 )
...
* update model.py
- change upper bound for "lags" hyperparameter
* update test_forecast.py
- add a test for a large dataset
* update sample.py
- pre-commit changes
2022-01-30 07:30:30 -08:00
Kevin Chen
81f54026c9
Support time series forecasting for discrete target variable ( #416 )
...
* support 'ts_forecast_classification' task to forecast discrete values
* update test_forecast.py
- add test for forecasting discrete values
* update test_model.py
* pre-commit changes
2022-01-24 18:39:36 -08:00
Chi Wang
6a7caa6a3d
max_iter < 2 -> no search; sign in metric constraints; test and example for forecasting ( #415 )
...
* max_iter < 2 -> no search
* use_ray in test
* eval_method in ts example
* check sign of constraints
* test metric constraint sign
2022-01-23 01:24:15 -08:00
Chi Wang
38ad31ea25
remove FLAML sample size from config ( #418 )
2022-01-22 22:59:44 -08:00
Kevin Chen
d4273669e6
Time series forecasting with sklearn regressors ( #362 )
...
* add sklearn regressors as learners for ts_forecast task
* add direct forecasting strategy
warnings and errors for duplicate rows and missing values
- add preprocess for sklearn time series forecast
update automl.py
update test/test_forecast.py
* update model.py and test_forecast.py for cv eval_method
* add "hcrystalball" dependency in setup.py
* update automl.py
- add _validate_ts_data function for abstraction
- include xgb_limitdepth as a learner
* update model.py
- update search space for sklearn ts regressors
* update automl.py and test_forecast.py for numpy array inputs
* add documentations to model.py
* add documentation for removing catboost regressor
* update automl.py
- _validate_ts_data() function
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
2022-01-06 23:12:38 -08:00
Chi Wang
8602def1c4
logging ( #371 )
...
* query logged runs
* mlflow log when using ray
* key check for newer version of ray #363
* catch importerror
* log and load AutoML model
* retrain if necessary when ensemble fails
2022-01-02 21:37:19 -08:00
Chi Wang
2f5d6169d3
example update ( #359 )
...
update some examples for consistencies with others.
2021-12-25 16:13:39 -08:00
Chi Wang
0b25e89f29
reproducibility for random sampling ( #349 )
...
* reproducibility for random sampling #236
* doc update
2021-12-22 12:12:25 -08:00
Chi Wang
efd85b4c86
Deploy a new doc website ( #338 )
...
A new documentation website. And:
* add actions for doc
* update docstr
* installation instructions for doc dev
* unify README and Getting Started
* rename notebook
* doc about best_model_for_estimator #340
* docstr for keep_search_state #340
* DNN
Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu >
Co-authored-by: Z.sk <shaokunzhang@psu.edu >
2021-12-16 17:11:33 -08:00
Chia-Chi Hsu
671ccbbe3f
support for customized splitters ( #333 )
...
* add support for customized splitters
* use the param split_type for feeding generators
* use single API for customized splitter and add test
* when task==TS_FORCAST, always set shuffle=False
* update docstr
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
2021-12-16 16:13:04 -08:00
Qingyun Wu
17b17d084f
tune api for schedulers ( #322 )
...
* revise api and tests
* rename prune_attr
* update finetune notebook
* add scheduler test and notebook
* update tune api for scheduler
* remove scheduler notebook
* Update flaml/tune/tune.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* docstr
* fix imports
* clear notebook output
* fix ray import
* Update flaml/tune/tune.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* improve docstr
* Update flaml/searcher/blendsearch.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* remove redundant import
Co-authored-by: Qingyun Wu <qxw5138@psu.edu >
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
2021-12-04 21:52:20 -05:00
Chi Wang
7d269435ae
add save_best_config()
2021-12-04 16:29:52 -08:00
Chi Wang
18230ed22f
pred_time_limit clarification and logging ( #319 )
...
* pred_time_limit clarification
* log prediction time
* handle ChunkedEncodingError in test
2021-12-03 16:02:00 -08:00
Chi Wang
c57954fbbd
include default value in rf search space ( #317 )
...
* include default value in rf search space
* init _mem_per_iter with -1
* bump version to 0.8.2
* docstr for search space's arguments
2021-12-03 09:15:21 -08:00
晓宇
5dc948da18
Update test_regression.py ( #306 )
...
* Update test_regression.py
There is a another way for mutioutput-model-trian.
RegressorChain is more adapting to the targets which are relavant.
2021-11-25 08:18:22 -08:00
Chi Wang
85e21864ce
test -> val; docstr ( #300 )
...
* rename test -> val in custom metric function
* add an example in docstr
resolve #299
2021-11-22 22:17:29 -08:00
Chi Wang
d937b03e42
multioutput regression ( #292 )
...
* make AutoML inherit sklearn.base.BaseEstimator such that it can be wrapped in sklearn.multioutput.MultiOutputRegressor for multi-output regression.
* moved and simplified preprocessing code in AutoML.predictI() to _preprocess()
2021-11-22 06:59:42 -08:00