The Data Community As A Service
Last Updated on July 25, 2023 by Editorial Team
Author(s): Gift Ojeabulu
Originally published on Towards AI.
An opinionated perspective on data communities, their importance, benefits, and how community members are gaining from it.
Why this article?
I wrote this article to educate and make data enthusiasts and practitioners see the importance of community to individuals in the data space worldwide.
Alone, we can do so little; together, we can do so much β Helen Keller.
Our world is connected and increasingly disillusioned with traditional career paths and traditional education. In such a world, is it any surprise that people put in extra effort on access to networking in the data field and learning experiences?
What is the best way to attain those experiences reliably and at a low cost?
The answer is D.C.A.A.S β Data-Community-As-A-Service.
Data community-as-a-service is the phenomenon where access to a group of data practitioners is valuable enough that we can consider it a mutual relationship between the data community and data practitioners.
For whom is this article?
- Technical and non-technical data professionals.
- Data enthusiasts who are interested in joining data communities to connect with other data practitioners and to get a clear path in their data careers.
- Data professionals looking for the best data communities to join with like minds.
What you will learn?
- Why Data Community as a service?
- What is a data community?
- The benefit of belonging to data communities as a data enthusiast or practitioner.
- List of data communities for data enthusiasts, and professionals, and their specific impact.
Why Data community As A Service?
The community itself defines humans, A conglomeration of communities βforms states, countries, and continents.
The developer community is not an exception.
The rate at which people are getting into the tech industry in different parts of the world keeps rising every day.
Developers in this context are data practitioners since Iβm talking about the data community as a service.
As data grows, the need for other data careers keeps growing since data is everywhere and information is scarce.
Data Community as a service is a concept that allows members of the data community to learn and share ideas while maintaining guardrails and parameters that allow the data community to focus on a greater purpose for the betterment of all, the success of this concept relies on the trusted status of the community, so its members feel comfortable sharing and engaging in the community-Gift Ojeabulu
As the data careers keep growing, there is a need for building data communities, since humans are naturally gregarious, I.e. humans are social.
Human beings are social animals and therefore naturally seek the companionship of others as part of their well being-Aristotle
What is a data community?
We can simply define the data community as a group of technical and non-technical people i.e Data Scientists, Data Engineers, Data Analysts, Data product managers, data community managers, and so on.
We encapsulate them under an umbrella called data practitioners that have certain attitudes or interests in common, which includes three key elements:
- Community participation β Answering, asking questions, and supporting each other.
- Organization β Organizing meetups, hackathons & Conferences.
- Work β Building, contributing to open source projects, & providing data solutions at their workplace.
From an organizationβs perspective, a data community can be valuable for marketing and sales initiatives but the goal should be a focus on providing value to data practitioners in your community foremost, to ensure that data practitioners find value in the community worth their time, it will pay off in the long run β in summary, it is all about genuine relationship building.
Benefits of joining a community as a data scientist or machine learning engineer
In this context, A Data scientist or a Machine learning engineer represents the generic name for a data practitioner.
I use this because that is the closest profession to what I do.
If you are a data engineer, replace it as a benefit of joining a community as a data engineer.
- Direction.
- Networking & opportunity.
- Broadens your knowledge.
- Support, motivation, and persistency
- Direction -It directs you rightly. The Data field is full of complexities, especially for beginners. Most times, you get started and need clarification. So joining a community directs you.
You can relate and ask experienced people, then decide where to focus. You canβt learn everything in the data science field though matter how smart you are.
But getting directed on the right path, and learning what you need to learn to land a job, is very important. Since your implicit reason is to make money.
Note: A lot of beginners complain they need a mentor.
As a beginner, have you ever considered staying active in communities and asking questions when things go wrong?
Mentorship doesnβt have to be direct. A community can fill up that gap.
2. Networking opportunity β When you join a community, you βmeet different people, and there are a lot of job opportunities in the community. Many get referrals that land them their dream job just by joining a community.
3. It expands your area of expertise -When you join a community; you notice what is happening.
It exposed you to discussions about data science-related topics. You see people giving answers to what you are struggling with.
It builds and improves you unconsciously.
4. For support, motivation & persistence β The journey can be very tough. Sometimes you discover you just keep procrastinating because of a lack of support and motivation, but when you join a community, seeing people building and learning motivates you not to give up.
My Advice: Join communities and stay active
List of top data communities for data enthusiasts, professionals, and their specific impact.
These are some lists of the data communities I have found helpful in the data space from personal experience, research, online reviews, and conversations with members of the community.
The DataTalksClub community is a global online community of data enthusiasts, Data/product analysts, data scientists, Machine learning engineers, data engineers, analytics engineers, developer advocates, and software engineer / Python developers led by Alexey Grigorev.
It is a place to talk about data, machine learning, and engineering, attend weekly events and learn from free courses, ask career questions and
discuss career options.
DataTalksClub has some amazing resources like MLOps Zoomcamp, Data Engineering Zoomcamp, and ML zoom camp.
Community guidelines, Slack channel
The MLOps Community Led by Demetrios Brinkmann fills the swiftly growing need to share real-world Machine Learning Operationsβ best practices from engineers in the field.
The MLOps community needed a community laser-focused on solving the unique daily challenges in building production AI/ML pipelines.
The MLOps community is in 18 countries with over 5.2k members.
Examples of some MLOps local meetups are:
- MLOps community Lagos organized by Gift Ojeabulu, Stephen Oladele, and Tochukwu Nwoke
- MLOps community in Toronto.
All Meetup Link, Slack channel
DataFestAfrica, now Data community Africa, started as a discussion of 4 data practitioners (Gift Ojeabulu, David Abu, Oyinbooke Olarenwaju, and Opeyemi Fabiyi) who saw the need to create a community for all data-related careers in Africa.
It is the first-ever event and community that showcases and brings all career professions within the data space together in Africa.
If you are a Data Analyst, Data Scientist, Data Engineer, AI/ML Engineer, Low Code Developer, student, researcher, Data Manager, Data Consumer, or recruiter within this space, then Data Community Africa is for you.
Data Community Africa has built the data community in Africa through holding solution-based Hackathon, Conference, and interactive online community that has a track record of helping data enthusiasts and professionals.
Online Community
Kaggle, a subsidiary of Google LLC, is an online community of data scientists and machine learning practitioners. Kaggle allows users to find and publish data sets, explore and build models in a web-based data-science environment, work with other data scientists and machine learning engineers, and enter competitions to solve data science challenges.
Kaggle got its start in 2010 by offering machine learning competitions and now also offers a public data platform, a cloud-based workbench for data science, and Artificial Intelligence education.
Cohere For AI is a non-profit research lab led by Sarah Hooker that seeks to solve complex machine-learning problems. This community supports fundamental research that explores the unknown and is focused on creating more points of entry into machine learning research.
This community is a space where researchers, engineers, linguists, social scientists, and lifelong learners connect and collaborate. Individuals come together from all over the world and welcome you, whether you are a mentor, dropout, just getting started, PhD, masterβs, undergraduate, unaffiliated, industry, academic, or not sure.
The Cohere for AI community is excited to support community-driven research and to be shaped by membersβ interests.
DataTech Space (DTS) is a free online community led by Jessica Uwoghiren exclusive to individuals with an interest in Data-related fields (i.e. Data Analytics, Data Science, Data Engineering, Machine Learning, etc.) Regardless of their skill level.
This community provided a support system for budding data enthusiasts and connected them with their peers and experienced professionals.
This community has 2 levels, the telegram and discord channel.
Young Data Professionals (YDP) is a tribe and vibrant community of smart, young, and energetic data lovers across different levels of expertise led by Opeyemi Fabiyi.
YDP engages in robust discussions on data and proffers solutions to membersβ challenging tasks. YDP also help community members sharpen their skills towards staying updated with the global trends in the Analytics space.
YDP community is the driver of dbt Lagos Nigeria U+1F1F3U+1F1EC Meetup and leads the awareness of Analytics Engineering in the African data ecosystem.
DBT is one of the most popular open-source tools right now. The online Slack community for dbt boasts one of the largest hubs of conversation for Analytics Engineering, with over 22,700 members in total!
While the conversation is typically related to dbt and the tooling that connects to it, some of the best minds in data are actively sharing their thoughts here.
It doesnβt hurt that the community itself is one of the most welcoming, heartwarming groups out there.
The Hugging Face community led by Omar Sansevero is on a journey to advance good Machine Learning and make it more accessible. Along the way, hugging face contributes to the development of technology for the better.
Hugging Face has a thriving open-source ecosystem with all backgrounds (from ML Researchers to students interested in model biases). With over a dozen open-source projects, an amazing community, and thousands of contributors and collaborators.
Thousands of creators work as a community to solve audio, vision, and language with AI.
Discord community, Forum, Classroom
Masakhane is a grassroots organization whose mission is to strengthen and spur NLP research in African languages, for Africans, by Africans.
Zindi hosts the largest community of African data scientists, working to solve the worldβs most pressing challenges using machine learning and AI.
Zindi connects data scientists with organizations and provides a place to learn, hone your skills, and find a job.
Zindi aims to transform the African continent and showcase African data science talent to the world.
Other amazing data communities:
References
[1] Jovian Cicmil, https://medium.com/swlh/community-as-a-service-a-business-model-for-the-21st-century-b7e0612e7095 (2021)
And we draw into conclusion hereβ¦
Attending tech events, and joining & staying active in a data community, are one of the best experiences in the data space as an experienced or novice data practitioner.
Congratulations! You now have an excellent knowledge of the benefit of belonging to data communities as a data enthusiast or practitioner, data community as a service, and active data communities for data enthusiasts and professionals.
I canβt wait to see you adopt what I explained in this article and start making waves with them in the data space.
Connect with me on LinkedIn
Connect with me on Twitter
Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming aΒ sponsor.
Published via Towards AI