This module is meticulously designed to equip students with essential knowledge and skills required for the design, development, and implementation of data science projects in both business and research environments. Central to this module is the practical understanding of acquiring, processing, and storing real-world data within a big data framework.
Students will be adept in querying databases through application programming interfaces (APIs), utilizing common database frameworks tailored for structured and unstructured data, and handling dynamic and large-scale data effectively. The module also imparts critical knowledge in refactoring machine learning models and associated code for deployment in web-based applications.
Throughout the course, students will engage in hands-on activities that mirror real-world challenges associated with deploying machine learning models into end-to-end solutions. This practical approach encompasses the entire spectrum of data science workflows—from data acquisition and processing to the operational deployment of machine learning models.
By the end of this module, students will have gained a robust understanding of the processes, techniques, and workflows essential to delivering functional machine learning solutions. They will be equipped to autonomously plan, manage, and execute complex machine learning projects, including developing client-facing application interfaces, with a clear emphasis on practical application and industry relevance.
John, M. M., Olsson, H. H., & Bosch, J. (2021, September). Towards mlops: A framework and maturity model. In 2021 47th Euromicro Conference on Software Engineering and Advanced Applications (SEAA) (pp. 1-8). IEEE
Calefato, F., Lanubile, F., & Quaranta, L. (2022, September). A preliminary investigation of MLOps practices in GitHub. In Proceedings of the 16th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (pp. 283-288).
Mäkinen, S., Skogström, H., Laaksonen, E., & Mikkonen, T. (2021, May). Who needs MLOps: What data scientists seek to accomplish and how can MLOps help?. In 2021 IEEE/ACM 1st Workshop on AI Engineering-Software Engineering for AI (WAIN) (pp. 109-112). IEEE
Openja, M., Majidi, F., Khomh, F., Chembakottu, B., & Li, H. (2022). Studying the Practices of Deploying Machine Learning Projects on Docker. arXiv preprint arXiv:2206.00699.
Granlund, T., Kopponen, A., Stirbu, V., Myllyaho, L., & Mikkonen, T. (2021, May). Mlops challenges in multi-organization setup: Experiences from two real-world cases. In 2021 IEEE/ACM 1st Workshop on AI Engineering-Software Engineering for AI (WAIN) (pp. 82-88). IEEE
How Netflix works: the (hugely simplified) complex stuff that happens every time you hit Play (2017)
Bowles, M. (2019). Machine Learning with Spark and Python: Essential Techniques for Predictive Analytics. John Wiley & Sons.
Made with ML (2023)
SQLite vs TinyDB (2021)
What is MLOps (2019)
MLOps Best Practices for Machine Learning Model Development, Deployment, and Maintenance (2022)
Build and Run a Docker Container for your Machine Learning Model (2021)