Leveraging Small Language Models in NuNet Containers for Efficient ML Deployments
Estimation
Story points: 30 SP
Estimated focus duration (perfect conditions): 30 days
Estimated pessimistic duration (worst case scenario): 60 days
Description
Click to expand
Who
- @avimanyu786
- @ALISHAHMUHAMMAD - collaborator for task 4
What
- The integration of Small Language Models (SLMs), specifically the phi-2 model (license under the MIT License), into NuNet containers to enhance machine learning deployments with focused accuracy, minimized resources, and enhanced privacy.
How
- Conduct a feasibility study on SLM integration into NuNet containers, focusing on the phi-2 model.
- Optimize NuNet containers to support the deployment and efficient operation of SLMs.
- Benchmark the performance and resource utilization of SLMs within NuNet containers against traditional models.
- Engage with the ML/AI community for feedback on SLM integration into NuNet containers.
- Develop comprehensive documentation and guidelines for SLM deployment in NuNet containers.
Why
- To leverage the efficiency, performance, and privacy advantages of SLMs like phi-2, making AI more accessible and sustainable while enhancing NuNet's container ecosystem.
When
- The project kick-off is scheduled since March 26 2024.
Acceptance Criteria
Click to expand
- Successful deployment of the phi-2 model within a NuNet container, verified through comprehensive testing.
- Demonstrated efficiency and performance improvements of SLMs in NuNet containers, supported by benchmarking results.
- Documentation and guidelines for deploying SLMs in NuNet containers are complete and accessible.
- Positive feedback and engagement from the ML/AI community on the integration of SLMs into NuNet containers.
- All necessary optimizations for NuNet containers to support SLMs are identified and implemented.
Screenshots
Benchmarks
Work Breakdown Structure (WBS)
| Task | Description | Duration | Status | Start Date | End Date | Comment |
|---|---|---|---|---|---|---|
| 1 | Feasibility study for SLM integration | x Hrs | In Progress | March 26 2024 | Initial assessment and planning | |
| 2 | Optimization of NuNet containers | x Hrs | In Progress | April 11 2024 | Includes performance and privacy enhancements | |
| 3 | Benchmarking and testing | x Hrs | In Progress | April 22 2024 | Performance comparison and validation | |
| 4 | Community engagement and feedback collection | x Hrs | In Progress | April 29 2024 | Outreach and analysis of feedback | |
| 5 | Documentation and guidelines development | x Hrs | In Progress | June 3 2024 | Creation of user-friendly materials |
References
Edited by Avimanyu Bandyopadhyay




