Injulkar Nilesh
Code. Read. Trek.
Featured
Object Oriented JavaScript
This post has been migrated from injulkarnilesh.blogspot.com. Though most widely used programming language over the Internet, JavaScript is very misunderstood and weird programming language. JavaScript is loosely typed, object oriented, functional programming language where everything is an object. Types in JavaScript JavaScript has three primitive data types : Number String...
Building Word Cloud of fake news debunked by Alt News for learning Python
Alt News is Indian fake news debunking site run by pretty liberal group of people. Thanks to high penetration of Facebook, Twitter, WhatsApp, cheap data rates, and non professional main stream media, India has become hotbed of fake news. These platforms are exploited for every possible purpose you can name,...
Regular
Personal Elastic Search Cheat Sheet
Elastic Search is great but the API syntax is a little weird. So, I noted down some important APIs I came across in the book Elasticsearch: The Definitive Guide.
How Elastic Search Search
Elastic Search is a horizontally scalable search engine with support for near real time search. Here, we will go over how this Elastic Search does it.
Running Self Contained PySpark Package On Yarn Cluster
Python packaging Being new to Python and PySpark, and had to test PySpark feasibility on old Hortanworks Data Platform (HDP) cluster, I had many questions. Having worked on Java, Spark I was expecting similar workflow for how we would run the PySpark application on the cluster. I assumed there would...
Word count the Spark way
Problem Statement: Given a file of text, output what all words appear in the file and how many times.
Understanding Spark Architecture
Spark is distributed data processing framework used in Big Data world to process a big amount of data in a distributed way to process it parallelly and so faster.