In this blog We will be analyzing Apache web server log files and extract the HTTP request code information from log files and get the total number of responses for each status code. First we need to import the required packages. Pyspark is an interface for Apache Spark in Python. Pyspark isn't on path by... Continue Reading →