Commit Graph

3198 Commits

Author SHA1 Message Date
levscaut
5eece5c748 Enhance Integration with Spark (#1097)
* add doc for spark

* labelCol equals to label by default

* change title and reformat

* reference about default index type

* fix doc build

* Update website/docs/Examples/Integrate - Spark.md

* update doc

* Added more references

* remove exception case when `y_train.name` is None

* fix broken link

---------

Co-authored-by: Wendong Li <v-wendongli@microsoft.com>
Co-authored-by: Li Jiang <bnujli@gmail.com>
2023-07-10 04:44:01 +00:00
Chi Wang
9c09090bf9 update notebook with new models (#1112)
* rerun notebook with new models

* planning agent

* link to notebook
2023-07-09 05:25:43 +00:00
Yiran Wu
ca10b286cc Support function_call in autogen/agent (#1091)
* update funccall

* code format

* update to comments

* update notebook

* remove test for py3.7

* allow funccall to class functions

* add test and clean up notebook

* revise notebook and test

* update

* update mathagent

* Update flaml/autogen/agent/agent.py

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* Update flaml/autogen/agent/user_proxy_agent.py

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* revise to comments

* revise function call design, notebook and test. add doc

* code format

* ad message_to_dict function

* update mathproxyagent

* revise docstr

* update

* Update flaml/autogen/agent/math_user_proxy_agent.py

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* Update flaml/autogen/agent/math_user_proxy_agent.py

Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu>

* Update flaml/autogen/agent/user_proxy_agent.py

Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu>

* simply funccall in userproxyagent, rewind auto-gen.md, revise to comments

* code format

* update

* remove notebook for another pr

* revise oai_conversation part  in agent, revise function exec in user_proxy_agent

* update test_funccall

* update

* update

* fix pydantic version

* Update test/autogen/test_agent.py

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* fix bug

* fix bug

* update

* update is_termination_msg to accept dict

---------

Co-authored-by: Chi Wang <wang.chi@microsoft.com>
Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu>
Co-authored-by: Li Jiang <bnujli@gmail.com>
2023-07-05 22:08:44 +00:00
Shaokun
dd9202bb01 Update OptunaSearch (#1106)
* update optuna

* update setup

* fix dependencies

* fix bugs in test

* fix bugs, web format

---------

Co-authored-by: “skzhang1” <“shaokunzhang529@gmail.com”>
Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu>
2023-07-05 03:14:02 +00:00
Chi Wang
4f1dfe6676 doc update (#1089)
* doc update

* add link to mathchat notebook

* use_docker property

* function name

* version update

---------

Co-authored-by: kevin666aa <yrwu000627@gmail.com>
Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu>
2023-07-04 20:29:32 +00:00
Yiran Wu
8236533faf update math notebook (#1095)
Co-authored-by: Chi Wang <wang.chi@microsoft.com>
2023-07-02 15:26:30 +00:00
Shaokun
7a64148676 support string alg in tune (#1093)
* support string alg in tune

* add test, enforce string feasible, support lexico in set_search_priorities in CFO

* fix bug

* fix bug

* fix bug

* fix bug

* fix bugs

* fix yiran

---------

Co-authored-by: “skzhang1” <“shaokunzhang529@gmail.com”>
2023-07-01 03:01:14 +00:00
Li Jiang
aa05434c87 temp solution for joblib 1.3.0 issue (#1100)
* temp solution for joblib 1.3.0 issue, no need once https://github.com/joblib/joblib-spark/pull/48 is merged

* update option
2023-06-30 14:17:55 +00:00
Yiran Wu
e3ca95bf8a An agent implementation of MathChat (#1090)
* mathcaht implementation

* code forrmat

* update readme

* update openai.yml

* update openai.yml

* update openai.yml
2023-06-25 13:49:34 +00:00
EgorKraevTransferwise
5245efbd2c Factor out time series-related functionality into a time series Task object (#989)
* Refactor into automl subpackage

Moved some of the packages into an automl subpackage to tidy before the
task-based refactor. This is in response to discussions with the group
and a comment on the first task-based PR.

Only changes here are moving subpackages and modules into the new
automl, fixing imports to work with this structure and fixing some
dependencies in setup.py.

* Fix doc building post automl subpackage refactor

* Fix broken links in website post automl subpackage refactor

* Fix broken links in website post automl subpackage refactor

* Remove vw from test deps as this is breaking the build

* Move default back to the top-level

I'd moved this to automl as that's where it's used internally, but had
missed that this is actually part of the public interface so makes sense
to live where it was.

* Re-add top level modules with deprecation warnings

flaml.data, flaml.ml and flaml.model are re-added to the top level,
being re-exported from flaml.automl for backwards compatability. Adding
a deprecation warning so that we can have a planned removal later.

* Fix model.py line-endings

* WIP

* WIP - Notes below

Got to the point where the methods from AutoML are pulled to
GenericTask. Started removing private markers and removing the passing
of automl to these methods. Done with decide_split_type, started on
prepare_data. Need to do the others after

* Re-add generic_task

* Most of the merge done, test_forecast_automl fit succeeds, fails at predict()

* Remaining fixes - test_forecast.py passes

* Comment out holidays-related code as it's not currently used

* Further holidays cleanup

* Fix imports in a test

* tidy up validate_data in time series task

* Test fixes

* Fix tests: add Task.__str__

* Fix tests: test for ray.ObjectRef

* Hotwire TS_Sklearn wrapper to fix test fail

* Attempt at test fix

* Fix test where val_pred_y is a list

* Attempt to fix remaining tests

* Push to retrigger tests

* Push to retrigger tests

* Push to retrigger tests

* Push to retrigger tests

* Remove plots from automl/test_forecast

* Remove unused data size field from Task

* Fix import for CLASSIFICATION in notebook

* Monkey patch TFT to avoid plotting, to fix tests on MacOS

* Monkey patch TFT to avoid plotting v2, to fix tests on MacOS

* Monkey patch TFT to avoid plotting v2, to fix tests on MacOS

* Fix circular import

* remove redundant code in task.py post-merge

* Fix test: set svd_solver="full" in PCA

* Update flaml/automl/data.py

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* Fix review comments

* Fix task -> str in custom learner constructor

* Remove unused CLASSIFICATION imports

* Hotwire TS_Sklearn wrapper to fix test fail by setting
optimizer_for_horizon == False

* Revert changes to the automl_classification and pin FLAML version

* Fix imports in reverted notebook

* Fix FLAML version in automl notebooks

* Fix ml.py line endings

* Fix CLASSIFICATION task import in automl_classification notebook

* Uncomment pip install in notebook and revert import

Not convinced this will work because of installing an older version of
the package into the environment in which we're running the tests, but
let's see.

* Revert c6a5dd1a0

* Fix get_classification_objective import in suggest.py

* Remove hcrystallball docs reference in TS_Sklearn

* Merge markharley:extract-task-class-from-automl into this

* Fix import, remove smooth.py

* Fix dependencies to fix TFT fail on Windows Python 3.8 and 3.9

* Add tensorboardX dependency to fix TFT fail on Windows Python 3.8 and 3.9

* Set pytorch-lightning==1.9.0 to fix  TFT fail on Windows Python 3.8 and 3.9

* Set pytorch-lightning==1.9.0 to fix  TFT fail on Windows Python 3.8 and 3.9

* Disable PCA reduction of lagged features for now, to fix svd convervence fail

* Merge flaml/main into time_series_task

* Attempt to fix formatting

* Attempt to fix formatting

* tentatively implement holt-winters-no covariates

* fix forecast method, clean class

* checking external regressors too

* update test forecast

* remove duplicated test file, re-add sarimax, search space cleanup

* Update flaml/automl/model.py

removed links. Most important one probably was: https://robjhyndman.com/hyndsight/ets-regressors/

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* prevent short series

* add docs

* First attempt at merging Holt-Winters

* Linter fix

* Add holt-winters to TimeSeriesTask.estimators

* Fix spark test fail

* Attempt to fix another spark test fail

* Attempt to fix another spark test fail

* Change Black max line length to 127

* Change Black max line length to 120

* Add logging for ARIMA params, clean up time series models inheritance

* Add more logging for missing ARIMA params

* Remove a meaningless test causing a fail, add stricter check on ARIMA params

* Fix a bug in HoltWinters

* A pointless change to hopefully trigger the on and off KeyError in ARIMA.fit()

* Fix formatting

* Attempt to fix formatting

* Attempt to fix formatting

* Attempt to fix formatting

* Attempt to fix formatting

* Add type annotations to _train_with_config() in state.py

* Add type annotations to prepare_sample_train_data() in state.py

* Add docstring for time_col argument of AutoML.fit()

* Address @sonichi's comments on PR

* Fix formatting

* Fix formatting

* Reduce test time budget

* Reduce test time budget

* Increase time budget for the test to pass

* Remove redundant imports

* Remove more redundant imports

* Minor fixes of points raised by Qingyun

* Try to fix pandas import fail

* Try to fix pandas import fail, again

* Try to fix pandas import fail, again

* Try to fix pandas import fail, again

* Try to fix pandas import fail, again

* Try to fix pandas import fail, again

* Try to fix pandas import fail, again

* Try to fix pandas import fail, again

* Try to fix pandas import fail, again

* Try to fix pandas import fail, again

* Try to fix pandas import fail, again

* Formatting fixes

* More formatting fixes

* Added test that loops over TS models to ensure coverage

* Fix formatting issues

* Fix more formatting issues

* Fix random fail in check

* Put back in tests for ARIMA predict without fit

* Put back in tests for lgbm

* Update test/test_model.py

cover dedup

* Match target length to X length in missing test

---------

Co-authored-by: Mark Harley <mark.harley@transferwise.com>
Co-authored-by: Mark Harley <mharley.code@gmail.com>
Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu>
Co-authored-by: Chi Wang <wang.chi@microsoft.com>
Co-authored-by: Andrea W <a.ruggerini@ammagamma.com>
Co-authored-by: Andrea Ruggerini <nescio.adv@gmail.com>
Co-authored-by: Egor Kraev <Egor.Kraev@tw.com>
Co-authored-by: Li Jiang <bnujli@gmail.com>
2023-06-19 11:20:32 +00:00
Chi Wang
8760631349 string to array (#1086)
* string to array

* exclude aoai
2023-06-17 13:11:22 +00:00
Ryan Sweet
e35570b656 Merge pull request #10 from microsoft/imrpoving-devs
added retry logic if throttling eventually kills a task.
2023-06-16 23:36:24 -07:00
Ryan Sweet
0ed02dfb35 added retry logic if throttling eventually kills a task. 2023-06-17 06:35:23 +00:00
Ryan Sweet
3ed78a0286 Merge pull request #9 from microsoft/imrpoving-devs
Imrpoving devs
2023-06-16 22:37:11 -07:00
Ryan Sweet
542e5634fb backed off the improvement skill 2023-06-17 05:36:12 +00:00
Ryan Sweet
40394ac126 fixed - added max retry parameter, increased max timeout. 2023-06-17 05:25:24 +00:00
Ryan Sweet
dc83612d91 BROKEN - trying to add more options to the cli and cannot understand how System.Commandline binding is working/not working. 2023-06-17 05:09:11 +00:00
Ryan Sweet
521b30c6ba Merge pull request #8 from microsoft/tuning-skills
Feed README into Dev Lead when chaining
2023-06-16 19:39:06 -07:00
Ryan Sweet
c6f71ed0f5 Merge branch 'main' into tuning-skills 2023-06-16 19:38:54 -07:00
Ryan Sweet
1858d8ab39 have dev plan use README as input 2023-06-17 02:31:49 +00:00
Ryan Sweet
a8cc580fa3 Update Program.cs
add more http retries
2023-06-16 12:30:13 -07:00
Ryan Sweet
3d0cb7490c increase http retries, add the Dev Improve skill. Experimental. 2023-06-16 19:22:55 +00:00
Ryan Sweet
fdd4422a32 Update README.md 2023-06-16 09:32:06 -07:00
Ryan Sweet
e592a1c7e2 Update README.md 2023-06-16 09:30:33 -07:00
Ryan Sweet
f8388e507b Update README.md 2023-06-16 09:29:34 -07:00
Ryan Sweet
86556f98e8 Merge pull request #6 from microsoft/gh-workflow
add envvars in all projects
2023-06-16 09:20:47 -07:00
Chi Wang
e1da7f7d68 update openai model support (#1082)
* update openai model support

* new gpt3.5

* docstr

* function_call and content may co-exist

* test function call

---------

Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu>
2023-06-16 00:58:44 +00:00
Kosta Petan
c0db556cc2 add envvars in all projects 2023-06-15 12:34:32 +00:00
Kosta Petan
edfad0898d WAF into memory (#4)
* waf-import WIP

* waf import to qdrant

* waf WIP

* add qdrant-init container

* call the seed project in the postCreateCommand script

* add envvars

* update startup script
2023-06-15 13:57:47 +02:00
Chi Wang
0b739b8c93 Links to papers (#1084)
* Links to papers

* Update website/docs/Use-Cases/Auto-Generation.md

Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu>

---------

Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu>
2023-06-15 09:56:24 +00:00
Qingyun Wu
0c7082c7bf Docmentation for agents (#1057)
* add agent notebook and documentation

* fix bug

* set flush to True when printing msg in agent

* add a math problem in agent notebook

* remove

* header

* improve notebook doc

* notebook update

* improve notebook example

* improve doc

* improve notebook doc

* improve print

* doc

* human_input_mode

* human_input_mode str

* indent

* indent

* Update flaml/autogen/agent/user_proxy_agent.py

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* Update notebook/autogen_agent.ipynb

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* Update notebook/autogen_agent.ipynb

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* Update notebook/autogen_agent.ipynb

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* add agent doc

* del old files

* remove chat

* agent doc

* remove chat_agent

* naming

* improve documentation

* wording

* improve agent doc

* wording

* general auto reply

* update agent doc

* human input mode

* add agent figure

* update agent figure

* update agent example figure

* update code example

* extensibility of UserProxyAgent

---------

Co-authored-by: Chi Wang <wang.chi@microsoft.com>
2023-06-14 23:56:13 +00:00
Li Jiang
3874a429cf fix workflow (#1071) 2023-06-14 06:44:30 +00:00
Qingyun Wu
9356a92ba5 add pands requirement in benchmark option (#1070)
Co-authored-by: Li Jiang <bnujli@gmail.com>
2023-06-13 08:25:41 +00:00
Chi Wang
c5dfb03f0e encode timeout msg in bytes (#1078)
* encode timeout msg in bytes

* fix msg and test
2023-06-12 18:07:14 +00:00
Kosta Petan
9f082784c7 devcontainer with cosmos + qdrant (#3) 2023-06-12 11:58:09 +02:00
Chi Wang
a30d198530 Fix documentation (#1075)
* Fix indentation in documentation

* newline

* version
2023-06-11 01:03:49 +00:00
Ryan Sweet
90a527f65f Merge pull request #2 from microsoft/refactor_skills
Skills refactor, CLI calls local kernel
2023-06-09 12:58:38 -07:00
Kosta Petan
45983b9366 cli calls local skills, instead of http function call 2023-06-09 21:38:21 +02:00
Kosta Petan
f8485dc011 switch to lib skills 2023-06-09 21:05:01 +02:00
Chi Wang
5387a0a607 Agent notebook example with human feedback; Support shell command and multiple code blocks; Improve the system message for assistant agent; Improve utility functions for config lists; reuse docker image (#1056)
* add agent notebook and documentation

* fix bug

* set flush to True when printing msg in agent

* add a math problem in agent notebook

* remove

* header

* improve notebook doc

* notebook update

* improve notebook example

* improve doc

* agent notebook example with user feedback

* log

* log

* improve notebook doc

* improve print

* doc

* human_input_mode

* human_input_mode str

* indent

* indent

* Update flaml/autogen/agent/user_proxy_agent.py

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* shell command and multiple code blocks

* Update notebook/autogen_agent.ipynb

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* Update notebook/autogen_agent.ipynb

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* Update notebook/autogen_agent.ipynb

Co-authored-by: Chi Wang <wang.chi@microsoft.com>

* coding agent

* math notebook

* renaming and doc format

* typo

* infer lang

* sh

* docker

* docker

* reset consecutive autoreply counter

* fix explanation

* paper talk

* human feedback

* web info

* rename test

* config list explanation

* link to blogpost

* installation

* homepage features

* features

* features

* rename agent

* remove notebook

* notebook test

* docker command

* notebook update

* lang -> cmd

* notebook

* make it work for gpt-3.5

* return full log

* quote

* docker

* docker

* docker

* docker

* docker

* docker image list

* notebook

* notebook

* use_docker

* use_docker

* use_docker

* doc

* agent

* doc

* abs path

* pandas

* docker

* reuse docker image

* context window

* news

* print format

* pyspark version in py3.8

* pyspark in py3.8

* pyspark and ray

* quote

* pyspark

* pyspark

* pyspark

---------

Co-authored-by: Qingyun Wu <qingyun.wu@psu.edu>
2023-06-09 18:40:04 +00:00
Kosta Petan
943222bcbf extract skills to a classlib WIP 2023-06-09 17:10:41 +02:00
Kosta Petan
1fcac0179d Chaining the whole team (#1)
* do it all :)

* save implementation as it comes

* add sandbox skill

* generate files via the sandbox skill in output/src

---------

Co-authored-by: Kosta Petan <kosta.petan@microsoft.com>
2023-06-08 17:27:11 +02:00
Kosta Petan
0b920fcdfd parse output from skills 2023-06-07 16:57:45 +02:00
Kosta Petan
da38ad5dda implement cli commands 2023-06-02 16:33:22 +02:00
Kosta Petan
572857e3a1 add gitignore, remove obj and bin artifacts 2023-06-02 12:31:09 +02:00
Ryan Sweet
0481c1dffb initial commit 2023-06-01 10:58:29 -07:00
Li Jiang
d36b2afe7f suppress warning message of pandas_on_spark to_spark (#1058) 2023-06-01 16:04:01 +00:00
Ryan Sweet
f3a7e41312 Update README.md 2023-05-31 08:01:14 -07:00
Microsoft Open Source
fbafbfe9d6 LICENSE-CODE committed 2023-05-31 07:52:25 -07:00
Microsoft Open Source
aae493b495 SECURITY.md committed 2023-05-31 07:52:21 -07:00