Data Cleaning 101

Data Cleaning 101. Data quality is crucial aspect and centre of attraction for any data science project.

Research Data Strategy

Framework and motivating factors. Research organizations can benefit a lot from this, and this article aims to introduce its scientific variant: “Research Data Strategy” (RDS).

Excel’s Limitation Caused Loss of 16,000 Positive COVID Cases

Excel’s Limitation Caused Loss of 16,000 Positive COVID Cases. And 50,000 potentially infectious people missed by tracers and not told to self-isolate

Managing Data as a Data Engineer — Understanding Users

Understanding how users view data and their pain points when using data. In this article, I would like to share some of the things that I have learnt while managing terabytes of data in a fintech company.

Now is the best time to be a Data Scientist or a Data Steward in Europe

Now is the best time to be a Data Scientist or a Data Steward in Europe. Why the European Union plans to train over half a million data scientists and data stewards

Why Data Management remains a challenge in the Data and AI-first era

Why Data Management remains a challenge in the Data and AI-first era. What challenges companies face with data management and how to begin tackling them

Managing Data as a Data Engineer:  Understanding Data Changes

Understand how data changes in a fast growing company makes working with data challenging. In the last article, we looked at how users view data and the challenges they face while using data.

Graph Databases for the Public Sector

Graph databases also provide a way for government to track and analyze social media to help with law and order, for example, or to target crime ...

Introduction to Schema: A Python Libary to Validate your Data

Introduction to Schema: A Python Libary to Validate your Data. We can do that with schema. This article will show you how to use schema in a variety of scenarios.

How to improve data quality for Machine Learning?

How to improve data quality for Machine Learning? The ultimate goal of every data scientist or Machine Learning evangelist is to create a better model with higher predictive accuracy.

If You Work in “Small Science,” Are You Leveraging Data Repositories?

If You Work in “Small Science,” Are You Leveraging Data Repositories? Data repositories can help scientists with minimal resources make their work findable and citable.

Data — the starting point of a Data Science journey

Data — the starting point of a Data Science journey. Why is data important? What types of data are there? How we can use data in data science predictions?

CRD is just a table in Kubernetes

CRD, Custom Resource Definition is a special resource in Kubernetes. When you use Kubernetes in regular way, you don’t have to create that kind of resource. So this is not so important for many users. But sometimes, CRD appears in cutting edge blog posts, documents in kubernetes.io and discussion.

What data resources are used by Fintech companies?

To improve the performance of any organization the role of data is integral. An ability to process the right data by Fintech companies provides them a huge competitive advantage .

Why you need to treat AI models like data

Why treating models like data is a very strategic approach. Here is a very abstract question — What does an AI or data science model look like? We are all using data science models in our day to day life.

Data Validation: Key Solution for Big Data Management Challenges

In this article, we will go over key statistics highlighting the main data validation issues that currently impact big data companies.