Injulkar Nilesh

Featured

Object Oriented JavaScript

This post has been migrated from injulkarnilesh.blogspot.com. Though most widely used programming language over the Internet, JavaScript is very misunderstood and weird programming language. JavaScript is loosely typed, object oriented, functional programming language where everything is an object. Types in JavaScript JavaScript has three primitive data types : Number String...

25 December 2020 | #featured #javascript #object-oriented

Building Word Cloud of fake news debunked by Alt News for learning Python

Alt News is Indian fake news debunking site run by pretty liberal group of people. Thanks to high penetration of Facebook, Twitter, WhatsApp, cheap data rates, and non professional main stream media, India has become hotbed of fake news. These platforms are exploited for every possible purpose you can name,...

08 August 2020 | #featured #python #data

Regular

Personal Elastic Search Cheat Sheet

Elastic Search is great but the API syntax is a little weird. So, I noted down some important APIs I came across in the book Elasticsearch: The Definitive Guide.

20 October 2021 | #elastic #search

How Elastic Search Search

Elastic Search is a horizontally scalable search engine with support for near real time search. Here, we will go over how this Elastic Search does it.

15 September 2021 | #elastic #search

Running Self Contained PySpark Package On Yarn Cluster

Python packaging Being new to Python and PySpark, and had to test PySpark feasibility on old Hortanworks Data Platform (HDP) cluster, I had many questions. Having worked on Java, Spark I was expecting similar workflow for how we would run the PySpark application on the cluster. I assumed there would...

30 July 2021 | #spark #python #pyspark #yarn #HDP

Word count the Spark way

Problem Statement: Given a file of text, output what all words appear in the file and how many times.

29 June 2021 | #spark #rdd #dataset #sql

Understanding Spark Architecture

Spark is distributed data processing framework used in Big Data world to process a big amount of data in a distributed way to process it parallelly and so faster.

06 June 2021 | #spark #architecture