International Workshop on Serverless Machine Learning for Intelligent and Scalable AI Workflow

ServerlessAI 2021


Automation & Control Theory



With several innovations emerging in the domain of AI Automation, the next wave of automation will be focused on an AI applications. The design and development of AI workflows is at the core of emerging AI applications. AI workflows are “dataset centric”, with characteristics and quality of dataset varying across industries and applications. For example, it is common to have “big noisy data” in Oil and Gas industries as there are many oils pumps installed across a wide geographical space. On the other hand, the data generated by imaging technology is “wide clean data” (i.e., fewer number of records, but with a very high number of attributes). Researchers have introduced purpose-built programming models and pipelines APIs that allowing end users to construct a “Pipeline Graphs” to create an AI workflow for Automated Model Discovery. These programming interfaces support multiple machine learning ecosystems and frameworks, e.g., scikit-learn, Keras, pyearth, XGBoost, as part of the same pipeline graph definition, and new pipeline API such as CodeFlare Pipelines. Moreover, using Pipeline Graph we can specify multiple machine learning tasks such as Classification, Regression, Imputation, Time Series Forecasting, Imbalance Learning, Data Sampling, etc. In the race of getting the state-of-the-art result, the data scientists construct a very large Pipeline Graph. Typically, the size of Pipeline Graph varies across applications, across different AI tasks, or even across different personas. In summary, Execution of Pipeline Graph generates bursty workload and execution on Pipeline graph is also adhoc.
With emerging serverless platform offerings emerging, for example IBM Cloud Function and Code Engine, there are new opportunities to build a serverless machine learning toolkit to support the seamless execution of Pipeline Graphs as well as other common operations that can be scaled out. The on-demand capability of spinning up resources on Cloud with negligible instantiation using serverless technology is the center of attraction for AI workload. The focus on this workshop is to introduce serverless technology along with how it is leveraged to build next generation reusable serverless machine learning toolkit to be used for various AI Applications.
Research topics included in the workshop but not limited to the following
· AI application demonstration using serverless technology
· Design, Development and API extension for popular ML library such as sklearn to natively support serverless
· Experimental Analysis of Serverless vs traditional pre-configured
· Workflow manager design
· Scalability and fault tolerance
· Automated ML using serverless
· Feature engineering
· Benchmark papers including emerging technology such as Ray
· Explanability scale out
https://wi-lab.com/cyberchair/2021/bigdata21/index.php