{"id":481,"date":"2021-02-16T12:05:05","date_gmt":"2021-02-16T04:05:05","guid":{"rendered":"https:\/\/blog.frost-s.tk\/?p=481"},"modified":"2021-02-16T12:05:06","modified_gmt":"2021-02-16T04:05:06","slug":"spark-sql-%e5%86%85%e6%a0%b8%e5%88%86%e6%9e%90%ef%bc%88%e4%b8%80%ef%bc%89","status":"publish","type":"post","link":"https:\/\/blog.frost-s.com\/index.php\/2021\/02\/16\/spark-sql-%e5%86%85%e6%a0%b8%e5%88%86%e6%9e%90%ef%bc%88%e4%b8%80%ef%bc%89\/","title":{"rendered":"spark SQL \u5185\u6838\u5206\u6790\uff08\u4e00\uff09"},"content":{"rendered":"\n<p>\u672c\u6587\u7ae0\u7b80\u8981\u6982\u8ff0\u4e86spark sql \u7684\u6267\u884c\u6d41\u7a0b\u4ee5\u53ca\u57fa\u672c\u539f\u7406\u3002<\/p>\n\n\n\n<p>spark sql \u672c\u8d28\u662f\u5c06sql \u8bed\u53e5\u89e3\u6790\u4e3a\u5bf9\u5e94\u7684RDD\u6a21\u578b\u6765\u8fdb\u884c\u6267\u884cspark \u76f8\u5173\u7684\u8ba1\u7b97\u64cd\u4f5c\u3002<\/p>\n\n\n\n<p>\u5728spark \u4e2d\uff0cRDD\u4e2d\u7684\u7a84\u4f9d\u8d56\u662f\u7c7b\u4f3c\u4e8epipeline \u6765\u8fdb\u884c\u6267\u884c\u64cd\u4f5c\u7684\uff0c\u5bbd\u4f9d\u8d56\u662f\u9700\u8981\u5728\u4e0d\u540c\u7684\u8282\u70b9\u8fdb\u884cshuffle\u64cd\u4f5c\u3002<\/p>\n\n\n\n<p>\u603b\u7684\u6765\u8bf4\uff0cRDD \u80fd\u5f88\u65b9\u4fbf\u7684\u652f\u6301MapReduce\u5e94\u7528\u3001\u5173\u7cfb\u578b\u6570\u636e\u5904\u7406\u3001\u6d41\u5f0f\u6570\u636e\u5904\u7406\u3001\u4ee5\u53ca\u8fed\u4ee3\u578b\u7684\u5e94\u7528\u3002<\/p>\n\n\n\n<p>\u5728spark 2.0\u4e2dDataset \u6210\u4e3a\u4e86spark\u4e2d\u4e3b\u8981\u7684API\uff0c\u7ed3\u5408\u4e86RDD\u4ee5\u53caDataFrame\u7684\u7279\u70b9\uff0c\u5c5e\u4e8espark\u7684\u9ad8\u7ea7API\u3002<\/p>\n\n\n\n<p>spark sql \u7b80\u8981\u6267\u884c\u6d41\u7a0b\u8fc7\u7a0b\u6982\u89c8\uff1a<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><div class='fancybox-wrapper lazyload-container-unload' data-fancybox='post-images' href='https:\/\/blog.frost-s.com\/wp-content\/uploads\/2021\/02\/image-662x377.png'><img class=\"lazyload lazyload-style-2\" src=\"data:image\/svg+xml;base64,PCEtLUFyZ29uTG9hZGluZy0tPgo8c3ZnIHdpZHRoPSIxIiBoZWlnaHQ9IjEiIHhtbG5zPSJodHRwOi8vd3d3LnczLm9yZy8yMDAwL3N2ZyIgc3Ryb2tlPSIjZmZmZmZmMDAiPjxnPjwvZz4KPC9zdmc+\"  loading=\"lazy\" decoding=\"async\" width=\"662\" height=\"377\" data-original=\"https:\/\/blog.frost-s.com\/wp-content\/uploads\/2021\/02\/image-662x377.png\" src=\"data:image\/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAJcEhZcwAADsQAAA7EAZUrDhsAAAANSURBVBhXYzh8+PB\/AAffA0nNPuCLAAAAAElFTkSuQmCC\" alt=\"\" class=\"wp-image-482\"\/><\/div><\/figure>\n\n\n\n<p>LogicalPlan \u9636\u6bb5\u4f1a\u5c06\u7528\u6237\u6240\u8f93\u5165\u7684SQL Query  transition  to \u903b\u8f91\u7b97\u5b50\u6811\uff0csql \u8bed\u53e5\u4e2d\u6240\u5305\u542b\u7684\u903b\u8f91\u6620\u5c04\u5230\u903b\u8f91\u7b97\u5b50\u6811\u7684\u4e0d\u540c\u8282\u70b9\uff0c\u4e14\u903b\u8f91\u7b97\u5b50\u6811\u4e0d\u4f1a\u8fdb\u884c\u6267\u884c\uff0c\u53ea\u4f1a\u4f5c\u4e3a\u4e2d\u95f4\u9636\u6bb5\u3002<\/p>\n\n\n\n<p>\u903b\u8f91\u7b97\u5b50\u6811\u4e09\u5927\u5b50\u9636\u6bb5\uff1a<\/p>\n\n\n\n<p>\uff081\uff09Unresolved LogicalPlan\uff08\u672a\u89e3\u6790\u903b\u8f91\u7b97\u5b50\u6811\uff09\uff0c\u4ec5\u4ec5\u53ea\u662f\u6570\u636e\u7ed3\u6784\uff0c\u5176\u4e2d\u4e0d\u5305\u542b\u4efb\u4f55\u6570\u636e\u4fe1\u606f\u3002<\/p>\n\n\n\n<p>\uff082\uff09Analyzed LogicalPlan\uff08\u89e3\u6790\u540e\u7684\u903b\u8f91\u7b97\u5b50\u6811\uff09\uff0c\u8282\u70b9\u4e2d\u7ed1\u5b9a\u5404\u79cd\u4fe1\u606f\u3002<\/p>\n\n\n\n<p>\uff083\uff09Optimized LogcalPlan \uff08\u89e3\u6790\u4f18\u5316\u540e\u7684\u903b\u8f91\u7b97\u5b50\u6811\uff09\uff0c\u5e94\u7528\u5404\u79cd\u4f18\u5316\u89c4\u5219\u5bf9\u4e00\u4e9b\u4f4e\u6548\u7684\u903b\u8f91\u8ba1\u5212\u8fdb\u884c\u8f6c\u6362\u3002<\/p>\n\n\n\n<p>\u7269\u7406\u8ba1\u5212\u9636\u6bb5\uff0c\u4f1a\u5c06\u4e0a\u4e00\u6b65\u903b\u8f91\u8ba1\u7b97\u9636\u6bb5\u751f\u6210\u7684\u903b\u8f91\u7b97\u5b50\u6811\u6765\u8fdb\u884c\u5b9e\u9645\u8f6c\u6362\uff0c\u751f\u6210\u7269\u7406\u7b97\u5b50\u6811\uff0c\u5176\u4e2d\u7269\u7406\u8ba1\u5212\u9636\u6bb5\u4f1a\u76f4\u63a5\u751f\u6210RDD \u6216\u8005\u5bf9RDD\u6765\u8fdb\u884ctransformation \u64cd\u4f5c\uff0c\u6765\u5bf9\u5e94sql \uff0c\u5230\u6700\u540e\u624d\u8fdb\u884caction\u64cd\u4f5c\uff08\u5982\u5b9e\u4f8b\u4e2d\u7684show\u64cd\u4f5c\uff09\uff0c\u6765\u5bf9RDD\u6765\u8fdb\u884c\u5b9e\u9645\u63d0\u4ea4\u6267\u884c\u3002<\/p>\n\n\n\n<p>\u5176\u4e2dSQL \u8bed\u53e5\u5728\u89e3\u6790\u4e00\u76f4\u5230\u63d0\u4ea4\u4e4b\u524d\uff0c\u4e0a\u8ff0\u7684\u6574\u4e2a\u8f6c\u6362\u8fc7\u7a0b\u90fd\u662f\u5728spark\u96c6\u7fa4\u4e2ddriver\u7aef\u6765\u8fdb\u884c\u7684\uff0c\u4e0d\u6d89\u53ca\u5206\u5e03\u5f0f\u73af\u5883\uff0csparkSession \u7c7b\u7684sql \u65b9\u6cd5\u8c03\u7528sessionState\u4e2d\u7684\u5404\u79cd\u5bf9\u8c61\uff0c\u5305\u62ec\u4e0a\u8ff0\u7684\u4e0d\u540c\u9636\u6bb5\u751f\u6210\u7684\u5bf9\u5e94\u7684SparkSqlParser\u7c7b\u3001Analyzer\u7c7b\u3001Optimizer\u7c7b\u548csparkPlanner\u7c7b\u7b49\uff0c\u6700\u540e\u5c01\u88c5\u6210\u4e00\u4e2aQueryExcution\u5bf9\u8c61\u3002\u56e0\u6b64\uff0c\u5728\u8fdb\u884csparkSql\u5f00\u53d1\u7684\u8fc7\u7a0b\u4e2d\uff0c\u53ef\u4ee5\u5f88\u65b9\u4fbf\u7684\u5c06\u6bcf\u4e00\u6b65\u751f\u6210\u7684\u8ba1\u5212\u5355\u72ec\u7684\u5265\u79bb\u51fa\u6765\u8fdb\u884c\u5206\u6790\u3002<\/p>\n\n\n\n<p>\u91cd\u70b9\uff1a<\/p>\n\n\n\n<p>spark SQL \u5185\u90e8\u5b9e\u73b0\u4e0a\u8ff0\u6d41\u7a0b\u4e2d\u5e73\u53f0\u65e0\u5173\u90e8\u5206\u7684\u57fa\u7840\u6846\u67b6\u79f0\u4e4b\u4e3aCatalyst\u3002<\/p>\n\n\n\n<p>Catalyst\u4e2d\u4e3b\u8981\u5305\u62ec\u4e86InternalRow\u4f53\u7cfb\u3001TreeNode\u4f53\u7cfb\u3001\u548cExpression\u4f53\u7cfb\u3002<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u672c\u6587\u7ae0\u7b80\u8981\u6982\u8ff0\u4e86spark sql \u7684\u6267\u884c\u6d41\u7a0b\u4ee5\u53ca\u57fa\u672c\u539f\u7406\u3002 spark sql \u672c\u8d28\u662f\u5c06sql \u8bed\u53e5\u89e3\u6790\u4e3a\u5bf9 [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":457,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11,19,9],"tags":[20,21],"_links":{"self":[{"href":"https:\/\/blog.frost-s.com\/index.php\/wp-json\/wp\/v2\/posts\/481"}],"collection":[{"href":"https:\/\/blog.frost-s.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.frost-s.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.frost-s.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.frost-s.com\/index.php\/wp-json\/wp\/v2\/comments?post=481"}],"version-history":[{"count":1,"href":"https:\/\/blog.frost-s.com\/index.php\/wp-json\/wp\/v2\/posts\/481\/revisions"}],"predecessor-version":[{"id":483,"href":"https:\/\/blog.frost-s.com\/index.php\/wp-json\/wp\/v2\/posts\/481\/revisions\/483"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blog.frost-s.com\/index.php\/wp-json\/wp\/v2\/media\/457"}],"wp:attachment":[{"href":"https:\/\/blog.frost-s.com\/index.php\/wp-json\/wp\/v2\/media?parent=481"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.frost-s.com\/index.php\/wp-json\/wp\/v2\/categories?post=481"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.frost-s.com\/index.php\/wp-json\/wp\/v2\/tags?post=481"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}