PySpark Recipes: A Problem-Solution Approach with PySpark2
Raju Kumar Mishra
<p>Quickly find solutions to common programming problems encountered while processing big data. Content is presented in the popular problem-solution format. Look up the programming problem that you want to solve. Read the solution. Apply the solution directly in your own code. Problem solved!<br/><i>PySpark Recipes</i> covers Hadoop and its shortcomings. The architecture of Spark, PySpark, and RDD are presented. You will learn to apply RDD to solve day-to-day big data problems. Python and NumPy are included and make it easy for new learners of PySpark to understand and adopt the model.<br/><br/><b>What You Will Learn </b><ul><li>Understand the advanced features of PySpark2 and SparkSQL<br/></li><li>Optimize your code<br/></li><li>Program SparkSQL with Python<br/></li><li>Use Spark Streaming and Spark MLlib with Python<br/></li><li>Perform graph analysis with GraphFrames<br/></li></ul><b>Who This Book Is For</b><b><br/></b>Data analysts, Python programmers, big data enthusiasts<br/><br/></p>
种类:
年:
2018
出版:
1
出版社:
Apress
语言:
english
页:
280
ISBN 10:
1484231414
ISBN 13:
9781484231418
文件:
PDF, 3.19 MB
IPFS:
,
english, 2018
因版权方投诉,本书无法下载