FIGURE 6 | Airbnb NYC cluster mean analysis
Topic
Team assignment to analyse, visualise, and report on Airbnb New York City (NYC) 2019 dataset (Kaggle, 2021).
Outcomes
What (2, 4)
Collaboratively delivered EDA, pre-processing, statistical analysis, data visualisation, and unsupervised machine learning (ML) using k-means clustering (Oluleye, 2023). Teamwork split into project management, coding, merging, and report writing.
So What (2, 4)
Figure 6 shows cluster-specific strategies are key to optimising revenue; however, regulation and saturation need to be considered. With a team new to ML, coding and report writing were iterative. Due to being virtual across time zones, document sharing used Google Drive, planning used Trello, and discussion used WhatsApp and Google Meet. Coding and report writing followed EDA within Cross Industry Standard Process for Data Mining (CRISP-DM) (Niakšu, 2015, Mukhiya & Ahmed, 2020; Oluleye, 2023).
Now What (2,4)
As our first practical machine learning coding, there was a steep learning curve for all. Subsequent assignments, such as feature reduction techniques, illustrate steps that could be taken with k-means clustering. Future development requires a deeper understanding of other machine learning techniques. Further experience with GitHub identified mistakes to be avoided in future collaborations.
Skills
EDA, k-means clustering, Matplotlib, Numpy, Pandas, Python, Scikit-Learn, Seaborn.
Feedback
Distinction. Tutor calls it an "impressive submission" that is clearly articulated with a thorough EDA and practical and actionable specific recommendations. Very positive team feedback: all felt this group would work well professionally.