-
-
Notifications
You must be signed in to change notification settings - Fork 65
Labels
help wantedOpen to participation from the communityOpen to participation from the community✨ goal: improvementImprovement to an existing featureImprovement to an existing feature🏁 status: ready for workReady for workReady for work💻 aspect: codeConcerns the software code in the repositoryConcerns the software code in the repository🟩 priority: lowLow priority and doesn't need to be rushedLow priority and doesn't need to be rushed
Description
Problem
The wikipedia data is being fetched using the wikipedia_fetch.py. Wikipedia mainly uses the CC_BY_SA 4.0 license and it api currently fetches data from all language edition of wikipedia.
There is need to complete the processing and reporting of the language data.
Description
Need to identify meaningful analysis like:
- Top 10 highest language usage
- Classifying represented and underrepresented languages
- Average count of article per language
- % of all Wikipedia articles that belong to the top 10 languages
- % of underrepresented languages
- Classify article count by regions
Alternatives
Can we use other visualizations for reporting?
Implementation
- I would be interested in implementing this feature.
Metadata
Metadata
Assignees
Labels
help wantedOpen to participation from the communityOpen to participation from the community✨ goal: improvementImprovement to an existing featureImprovement to an existing feature🏁 status: ready for workReady for workReady for work💻 aspect: codeConcerns the software code in the repositoryConcerns the software code in the repository🟩 priority: lowLow priority and doesn't need to be rushedLow priority and doesn't need to be rushed
Type
Projects
Status
Backlog