The Overview
STIS Big Data is a project to provide a portfolio of Big Data researches that are conducted by teams consisting of college students, lecturers and researcher staffs of STIS Polytechnic of Statistics. The research conducted by the college students are parts of the academic activities held in STIS Polytechnic of Statistics which is conducting research and development of knowledge and technologies to support statistical activities in national and international level. Founded in early 2019, our vision is to be a part of big data analysis world.
The Technology
At the lower levels, the programming usually written in Python
and R
. Python
is used more broadly for instance programming the web scraper to collect data when needed, preprocess and analyze data, build machine learning model from the data, etc. We store all the data collected in BigQuery
which is a serverless, highly-scalable, and cost-effective cloud data warehouse. Because there is no infrastructure to manage, we can focus on deeper analysis using SQL like language that we all love.
We firmly believe in the Don't Repeat Yourself principle which means that if a such robust tool or framework exists, we will use it except our needs exceed its abilities
We use Business Intelligence tool such as Tableau
or Power BI
to help us process large amounts of unstructured data from internal and external systems which then changed into reports, data visualizations, and/or dashboards. The results of each research are subsequently published on this website for public viewing.
Programming Language: Python
, R
, NodeJS
Web Server: Nginx
Web Dashboard CMS: Ghost
Database: BigQuery
, MySQL
Business Intelligence: Tableau
, Power BI
DevOps: GitLab
The Team
We are researchers, curious to utilize the Big Data to reveal the social and economic phenomena in our country. We have been initiating and performing some projects regarding it to support official statistics production conducted by BPS (Statistics Indonesia). The results of our studies are published on our website. Our team consists of data engineers, data analysts, and project manager. Click the link below to know more about us