Qingyun Wu
2cdc08a75a
update notebook and test
2022-03-30 19:11:10 -07:00
Chi Wang
9128c8811a
handle failing trials ( #505 )
...
* handle failing trials
* clarify when to return {}
* skip ensemble in accuracy check
2022-03-28 16:57:52 -07:00
Qingyun Wu
6c16e47e42
Bug fix and add documentation for metric_constraints ( #498 )
...
* metric constraint documentation
* update link
* update notebook
* fix a bug in adding 'time_total_s' to result
* use the default multiple factor from config file
* update notebook
* format
* improve test
* revise test budget for macos
* bug fix in adding time_total_s
* increase performance check budget
* revise test
* update notebook
* uncomment test
* remove redundancy
* clear output
* remove n_jobs
* remove constraint in notebook
* increase budget
* revise test
* add python version
* use getattr
* improve code robustness
Co-authored-by: Qingyun Wu <qxw5138@psu.edu >
2022-03-26 21:11:45 -04:00
Xueqing Liu
72301b8568
fixing a few bugs in nlp ( #503 )
...
* fixing bugs in nlp
2022-03-26 14:08:51 -04:00
Chi Wang
7eb7b46ea9
version number and doc ( #497 )
...
* version number
* add missing tasks in documentation
* update node-forge version
2022-03-25 17:32:37 -07:00
Xueqing Liu
5f97532986
adding evaluation ( #495 )
...
* adding automl.score
* fixing the metric name in train_with_config
* adding pickle after score
* fixing a bug in automl.pickle
2022-03-25 17:00:08 -04:00
Xueqing Liu
af423463c3
fixing bug for ner ( #463 )
...
* fixing bug for ner
* removing global var
* adding class for trial counter
* adding notebook
* adding use_ray dict
* updating documentation for nlp
2022-03-20 22:03:02 -04:00
Qingyun Wu
f6ae1331f5
metric constraints in flaml.automl ( #479 )
...
* metric constraints
* revise docstr
* fix docstr
* improve docstr
* Update flaml/automl.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* Update flaml/automl.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* Update flaml/automl.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* docstr
Co-authored-by: Qingyun Wu <qxw5138@psu.edu >
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
2022-03-12 00:39:35 -05:00
Kevin Chen
f9eda0cc40
update documentation for time series forecasting ( #472 )
...
* update automl.py
- documentation update
* update test_forecast.py
* update model.py
* update automl_time_series_forecast.ipynb
* update time series forecast website examples
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
2022-03-08 11:21:18 -08:00
Chi Wang
31ac984c4b
don't init global search with points_to_evaluate unless evaluated_rewards is provided; handle callbacks in fit kwargs ( #469 )
2022-03-01 18:39:16 -08:00
Chi Wang
df01031cfe
Zero-shot AutoML ( #468 )
...
* Prepare for release
Co-authored-by: Moe Kayali <t-moekayali@microsoft.com >
* bug fix
* improve doc and code quality
Co-authored-by: Qingyun Wu
2022-03-01 15:39:09 -08:00
Chi Wang
9e88f22167
fix a bug when using ray & update ray on aml ( #455 )
...
* fix a bug when using ray & update ray on aml
When using with_parameters(), the config argument must be the first argument in the trainable function.
* make training function runnable standalone
2022-02-11 20:14:10 -08:00
Chi Wang
b4d312412a
bump ray version to 1.10 ( #450 )
...
* bump ray version to 1.10
* init ray in test
* Update setup.py to include hotfixes
Co-authored-by: Antoni Baum <antoni.baum@protonmail.com >
2022-02-09 15:04:29 -08:00
Chi Wang
8a44dd4318
data in csv ( #430 )
...
* data in csv
* support ray ObjectRef #365
* use object store to store data when using ray
* make lgbm tuning example a test
* homepage title
2022-01-30 19:36:41 -08:00
Chi Wang
6960a833ec
Gpu support for xgboost ( #442 )
...
* xgboost gpu support
* test xgboost gpu
* test sparse data
* add xgboost test
* remove ray.init to avoid pytest error
2022-01-30 13:02:18 -08:00
Kevin Chen
c75f97b475
Change the upper bound for "lags" hyperparameter for sklearn forecast models ( #437 )
...
* update model.py
- change upper bound for "lags" hyperparameter
* update test_forecast.py
- add a test for a large dataset
* update sample.py
- pre-commit changes
2022-01-30 07:30:30 -08:00
Xueqing Liu
438ccaa0c9
adding catch for HTTP error ( #432 )
2022-01-29 22:53:32 -08:00
Kevin Chen
81f54026c9
Support time series forecasting for discrete target variable ( #416 )
...
* support 'ts_forecast_classification' task to forecast discrete values
* update test_forecast.py
- add test for forecasting discrete values
* update test_model.py
* pre-commit changes
2022-01-24 18:39:36 -08:00
Xueqing Liu
4814091d87
remove redundant imports ( #426 )
...
* remove redundant imports
* getting ride of hf dataset
2022-01-24 14:24:14 -08:00
Chi Wang
6a7caa6a3d
max_iter < 2 -> no search; sign in metric constraints; test and example for forecasting ( #415 )
...
* max_iter < 2 -> no search
* use_ray in test
* eval_method in ts example
* check sign of constraints
* test metric constraint sign
2022-01-23 01:24:15 -08:00
Chi Wang
38ad31ea25
remove FLAML sample size from config ( #418 )
2022-01-22 22:59:44 -08:00
Xueqing Liu
dda4ac90a1
moving intermediate_results logging from model.py to huggingface/trainer.py ( #403 )
...
* replacing val_loss with automl_metric
2022-01-14 17:26:10 -08:00
Chi Wang
569908fbe6
fix issues in logging, bug in space.py, constraint sign, and improve code coverage ( #388 )
...
* console log handler
* version update
* doc
* skippable steps
* notebook update
* constraint sign
* doc for constraints
* bug fix: define-by-run and unflatten_hierarchical
* const
* handle nested space in indexof()
* test grid search
* test suggestion
* model test
* >1 ckpts
* always increase iter count
* log total # iterations
* security patch
* make iter_per_learner consistent
2022-01-14 13:39:09 -08:00
Xueqing Liu
f41f1c2198
Logging multiple checkpoints ( #394 )
2022-01-12 19:50:39 -08:00
Xueqing Liu
bd66e40296
fixing load best model at the end ( #389 )
2022-01-11 10:47:53 -08:00
Kevin Chen
d4273669e6
Time series forecasting with sklearn regressors ( #362 )
...
* add sklearn regressors as learners for ts_forecast task
* add direct forecasting strategy
warnings and errors for duplicate rows and missing values
- add preprocess for sklearn time series forecast
update automl.py
update test/test_forecast.py
* update model.py and test_forecast.py for cv eval_method
* add "hcrystalball" dependency in setup.py
* update automl.py
- add _validate_ts_data function for abstraction
- include xgb_limitdepth as a learner
* update model.py
- update search space for sklearn ts regressors
* update automl.py and test_forecast.py for numpy array inputs
* add documentations to model.py
* add documentation for removing catboost regressor
* update automl.py
- _validate_ts_data() function
Signed-off-by: Kevin Chen <chenkevin.8787@gmail.com >
2022-01-06 23:12:38 -08:00
Chi Wang
612668e8ed
serialize TransformerEstimator ( #381 )
...
* serialize TransformerEstimator
* check has_attr
* custom metric needs trainer
* skip test on mac
2022-01-06 10:28:19 -08:00
Xueqing Liu
207b6935d9
adding token classification ( #376 )
...
* adding ner
2022-01-03 13:44:10 -05:00
Chi Wang
8602def1c4
logging ( #371 )
...
* query logged runs
* mlflow log when using ray
* key check for newer version of ray #363
* catch importerror
* log and load AutoML model
* retrain if necessary when ensemble fails
2022-01-02 21:37:19 -08:00
oberonbot
9c00e4272a
Finish the Multiple Choice Classification ( #367 )
...
* adding multiple choice
* update test cases (hard coded)
* merged common code in predict_proba and predict in TransformersEstimator
2022-01-02 20:12:34 -05:00
Chi Wang
2f5d6169d3
example update ( #359 )
...
update some examples for consistencies with others.
2021-12-25 16:13:39 -08:00
Xueqing Liu
b2900f4b22
fixing custom metric ( #357 )
...
* fixing the error for custom metric
2021-12-24 16:23:09 -05:00
Rui Zhuang
c6c0c29769
Simplify lgbm example ( #358 )
...
* simplify lgbm examples
* provide link to lgbm example script.
* simply lgbm example in the example script.
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
2021-12-23 23:05:14 -08:00
Xueqing Liu
dcfd218108
Fixing the bug in custom metric ( #356 )
...
* fixing the bug for custom metric
2021-12-23 18:44:53 -05:00
Chi Wang
300f286667
azureml + ray ( #344 )
...
* examples and documentation about how to use azureml + ray
2021-12-23 13:37:07 -08:00
Chi Wang
0b25e89f29
reproducibility for random sampling ( #349 )
...
* reproducibility for random sampling #236
* doc update
2021-12-22 12:12:25 -08:00
Xueqing Liu
ee3162e232
Adding the NLP task summarization ( #346 )
...
* Add test_autohf_summarization.py
* adding seq2seq
* Update flaml/nlp/huggingface/trainer.py
* rouge metrics
Co-authored-by: XinZofStevens <xzhao4346@gmail.com >
Co-authored-by: JinzhuoWu <wujinzhuo0105@gmail.com >
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
2021-12-20 14:19:32 -08:00
Chi Wang
efd85b4c86
Deploy a new doc website ( #338 )
...
A new documentation website. And:
* add actions for doc
* update docstr
* installation instructions for doc dev
* unify README and Getting Started
* rename notebook
* doc about best_model_for_estimator #340
* docstr for keep_search_state #340
* DNN
Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu >
Co-authored-by: Z.sk <shaokunzhang@psu.edu >
2021-12-16 17:11:33 -08:00
Chia-Chi Hsu
671ccbbe3f
support for customized splitters ( #333 )
...
* add support for customized splitters
* use the param split_type for feeding generators
* use single API for customized splitter and add test
* when task==TS_FORCAST, always set shuffle=False
* update docstr
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
2021-12-16 16:13:04 -08:00
Z.sk
7b24662dca
Makes the evaluation_function could receive the incumbent best result as input in Tune ( #339 )
...
* update tune function
* pass incumbent result to the training function
* Update test/tune/test_record_incumbent.py
* Update flaml/searcher/search_thread.py
* Update flaml/searcher/blendsearch.py
* Update flaml/tune/tune.py
* add constant variable
Co-authored-by: 张少坤 <zhangshaokun@fuzhi.ai >
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
2021-12-15 21:12:47 -08:00
Qingyun Wu
17b17d084f
tune api for schedulers ( #322 )
...
* revise api and tests
* rename prune_attr
* update finetune notebook
* add scheduler test and notebook
* update tune api for scheduler
* remove scheduler notebook
* Update flaml/tune/tune.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* docstr
* fix imports
* clear notebook output
* fix ray import
* Update flaml/tune/tune.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* improve docstr
* Update flaml/searcher/blendsearch.py
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
* remove redundant import
Co-authored-by: Qingyun Wu <qxw5138@psu.edu >
Co-authored-by: Chi Wang <wang.chi@microsoft.com >
2021-12-04 21:52:20 -05:00
Chi Wang
7d269435ae
add save_best_config()
2021-12-04 16:29:52 -08:00
Chi Wang
18230ed22f
pred_time_limit clarification and logging ( #319 )
...
* pred_time_limit clarification
* log prediction time
* handle ChunkedEncodingError in test
2021-12-03 16:02:00 -08:00
Xueqing Liu
fb59bb9928
adding TODOs for NLP module, so students can implement other tasks easier ( #321 )
...
* fixing ray pickle bug, skipping macosx bug, completing code for seqregression
* catching connectionerror
* ading TODOs for NLP module
2021-12-03 12:45:16 -05:00
Chi Wang
c57954fbbd
include default value in rf search space ( #317 )
...
* include default value in rf search space
* init _mem_per_iter with -1
* bump version to 0.8.2
* docstr for search space's arguments
2021-12-03 09:15:21 -08:00
Michal Chromcak
b0ef3b7995
Add conda forge minimal test ( #309 )
...
* add conda forge minimal test, create pytest markers
2021-11-25 13:58:20 -08:00
晓宇
5dc948da18
Update test_regression.py ( #306 )
...
* Update test_regression.py
There is a another way for mutioutput-model-trian.
RegressorChain is more adapting to the targets which are relavant.
2021-11-25 08:18:22 -08:00
Xueqing Liu
fd136b02d1
bug fix for TransformerEstimator ( #293 )
...
* fix checkpoint naming + trial id for non-ray mode, fix the bug in running test mode, delete all the checkpoints in non-ray mode
* finished testing for checkpoint naming, delete checkpoint, ray, max iter = 1
* adding predict_proba, address PR 293's comments
close #293 #291
2021-11-23 11:26:39 -08:00
Chi Wang
85e21864ce
test -> val; docstr ( #300 )
...
* rename test -> val in custom metric function
* add an example in docstr
resolve #299
2021-11-22 22:17:29 -08:00
Chi Wang
d937b03e42
multioutput regression ( #292 )
...
* make AutoML inherit sklearn.base.BaseEstimator such that it can be wrapped in sklearn.multioutput.MultiOutputRegressor for multi-output regression.
* moved and simplified preprocessing code in AutoML.predictI() to _preprocess()
2021-11-22 06:59:42 -08:00