NashTech Insights

Data Solutions

Data Governance, Data Quality, Data Ingestion, Data processing, Data pipelines, Data Mesh, Lake, Warehouse

Posts by categories

Posts by tags

A Quick demo: ArangoDB to Spark to Bigquery

Hi Folks!! In this blog, we are going to learn how we can integrate Spark with ArangoDB and Big Query to build a simple ETL pipeline. ArangoDB:  ArangoDB is a multi-model database system. It supports three data models (graphs, JSON documents, key/value) with one database core and a unified query language AQL (ArangoDB Query Language).  Apache Spark: Apache Spark is an open-source, distributed processing engine used for big data …

A Quick demo: ArangoDB to Spark to Bigquery Read More »

Data Mesh Architecture with Azure and DataBricks

Data Mesh is a new approach to data architecture and governance. Data Mesh and its principles bring a paradigm shift in how organizations approach data management, promoting autonomy, collaboration, and scalability. It empowers cross-functional teams to own and manage their data domains in a decentralized way. It ensures data quality and consistency across the organization’s data assets for …

Data Mesh Architecture with Azure and DataBricks Read More »

Data mesh – decentralized analytics with Data mesh in Education

What is a Data mesh? Data mesh is a new architectural approach coined by Zhamak Dehghani that emphasizes the decentralization of data ownership, governance, and infrastructure. The idea of Data Mesh was initially introduced in 2019, and its maturity has been evolving since 2021. It responds to the limitations of traditional centralized data warehouses, which …

Data mesh – decentralized analytics with Data mesh in Education Read More »

Building Snowflake CI/CD Pipeline using Azure DevOps

Key takeaways: The CI/CD pipeline plays a crucial role in most of software projects and data projects are not exceptional. By automating the deployment process of various objects such as tables, views, tasks, stored procedures, functions etc. It significantly reduces administrative burdens and cycle times. In this blog, I will provide you step by step …

Building Snowflake CI/CD Pipeline using Azure DevOps Read More »

technology, computer, background-7978080.jpg

Data Mesh – What it bring to business ?

Data Mesh is not a framework but is a paradigm for designing and operating data architectures. This paradigm helps in better manageability and scalability of the data with in your enterprise. Looking at the data solutions maturity in large organisations they all have centralised data already in place. But still, if there is need for …

Data Mesh – What it bring to business ? Read More »

focused professional man using laptop

For dashboard apps – JSON or CSV?

Intro The choice between JSON (JavaScript Object Notation) and CSV (Comma-Separated Values) as data formats for APIs that feed dashboard applications is a crucial decision for developers. There are distinct advantages and trade-offs for both, impacting aspects such as data representation, ease of use, flexibility, and performance. I’ve come across this decision point many times …

For dashboard apps – JSON or CSV? Read More »

clones, computer, cube-2029896.jpg

Importance of Data Quality: Ensuring Accuracy and Reliability

(The above image is courtesy of https://dfg-qa.ship-med.uni-greifswald.de/index.html we just refer same for demonstration purpose) In today’s data-driven world, businesses rely heavily on data to make informed decisions, improve operations, and gain a competitive edge. However, the value of data is only as good as its quality. Poor data quality can lead to incorrect insights, missed …

Importance of Data Quality: Ensuring Accuracy and Reliability Read More »

network, technology, connection-3424070.jpg

Protecting Your Business: The Importance of Data Security 

Introduction:  With the increasing use of digital technology in our daily lives, data has become one of the most valuable assets in the world. However, with this value comes the risk of data breaches and cyber attacks, which can have severe consequences for businesses. In this blog post, we will explore the importance of data …

Protecting Your Business: The Importance of Data Security  Read More »

blockchain, data, records-3513216.jpg

Mastering Word Embedding and Word2Vec

Word Embedding and Word2Vec have emerged as powerful techniques for understanding and representing the relationships between words. In this article, we will embark on a journey of exploration and implementation, diving into the world of Word Embedding and Word2Vec. By delving into their fundamentals and practical applications, we will uncover how these techniques revolutionize language …

Mastering Word Embedding and Word2Vec Read More »

Practice to setup a Test Environment

In the software testing lifecycle, we need to set up and manage the Test Environment to support accurate and reliable testing activities. It includes environment provisioning, test configuration, test data management, test devices and test maintenance. In this blog, I would like to share some practices to setup test environment in software testing: Cloud-Based Test …

Practice to setup a Test Environment Read More »