{"id":12281,"date":"2015-12-17T14:20:53","date_gmt":"2015-12-17T05:20:53","guid":{"rendered":"http:\/\/www.creationline.com\/?p=12281"},"modified":"2015-12-17T14:20:53","modified_gmt":"2015-12-17T05:20:53","slug":"kaggle%e3%81%ab%e6%8c%91%e6%88%a6%e3%81%97%e3%81%a6%e3%81%bf%e3%81%9f","status":"publish","type":"post","link":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281","title":{"rendered":"Kaggle\u306b\u6311\u6226\u3057\u3066\u307f\u305f"},"content":{"rendered":"<p><a href=\"\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-12352\" src=\"\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle.jpg\" alt=\"kaggle\" width=\"942\" height=\"261\" srcset=\"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle.jpg 942w, https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle-360x100.jpg 360w, https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle-768x213.jpg 768w, https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle-225x62.jpg 225w\" sizes=\"auto, (max-width: 942px) 100vw, 942px\" \/><\/a><\/p>\n<h2>\u306f\u3058\u3081\u306b<\/h2>\n<p>\u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3\u306e\u85e4\u7530\u3067\u3059\u3002<\/p>\n<p>\u4eca\u56de\u306fSpark\u3092\u4f7f\u3063\u3066\u30c7\u30fc\u30bf\u89e3\u6790\u3092\u884c\u3044\u307e\u3057\u305f\u3002\u30c7\u30fc\u30bf\u30b5\u30a4\u30a8\u30f3\u30c6\u30a3\u30b9\u30c8\u306e\u30b3\u30df\u30e5\u30cb\u30c6\u30a3\u3067\u3042\u308bKaggle\u3067\u884c\u308f\u308c\u3066\u3044\u308b\u7af6\u6280\u306b\u53c2\u52a0\u3057\u307e\u3057\u305f\u3002\u6311\u6226\u3057\u305f\u8ab2\u984c\u306f\u300eSpringleaf Marketing Response\u300f\u3067\u3059\u3002<\/p>\n<h2>Kaggle\u3068\u306f<\/h2>\n<p>\u4eca\u56de\u3068\u308a\u3042\u3052\u305fKaggle\u306b\u3064\u3044\u3066\u5c11\u3057\u8aac\u660e\u3044\u305f\u3057\u307e\u3059\u3002<\/p>\n<p>Kaggle\u3068\u306f\u4e16\u754c\u4e2d\u306e\u30c7\u30fc\u30bf\u30b5\u30a4\u30a8\u30f3\u30c6\u30a3\u30b9\u30c8\u9054\u304c\u8ab2\u984c\u89e3\u6c7a\u306e\u6700\u9069\u30e2\u30c7\u30eb\u3092\u7af6\u3044\u5408\u3046\u30b3\u30df\u30e5\u30cb\u30c6\u30a3\u30b5\u30a4\u30c8\u3067\u3059\u3002URL\u306f <a href=\"http:\/\/www.kaggle.com\/\">http:\/\/www.kaggle.com\/<\/a> \u3067\u3059\u3002<\/p>\n<p><a href=\"\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/53ce4ff188d1cb72b3ebe263265f39f8.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-12282\" src=\"\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/53ce4ff188d1cb72b3ebe263265f39f8.jpg\" alt=\"\u30b9\u30af\u30ea\u30fc\u30f3\u30b7\u30e7\u30c3\u30c8\" width=\"700\" height=\"525\" srcset=\"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/53ce4ff188d1cb72b3ebe263265f39f8.jpg 700w, https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/53ce4ff188d1cb72b3ebe263265f39f8-127x95.jpg 127w\" sizes=\"auto, (max-width: 700px) 100vw, 700px\" \/><\/a><\/p>\n<p>\u30b5\u30a4\u30c8\u306b\u306f\u69d8\u3005\u306a\u30b3\u30f3\u30da\u30c6\u30a3\u30b7\u30e7\u30f3\u304c\u7528\u610f\u3055\u308c\u3066\u304a\u308a\u3001\u65b0\u3057\u3044\u30b3\u30f3\u30da\u30c6\u30a3\u30b7\u30e7\u30f3\u3082\u968f\u6642\u8ffd\u52a0\u3055\u308c\u3066\u3044\u307e\u3059\u3002\u30b3\u30f3\u30da\u30c6\u30a3\u30b7\u30e7\u30f3\u306b\u306f\u8272\u3005\u306a\u7a2e\u985e\u304c\u3042\u308a\u3001\u4e2d\u306b\u306f\u8cde\u91d1\u304c\u4ed8\u3044\u3066\u3044\u308b\u3082\u306e\u3082\u3042\u308a\u307e\u3059\u3002\u5404\u30b3\u30f3\u30da\u30c6\u30a3\u30b7\u30e7\u30f3\u306e\u958b\u50ac\u671f\u9650\u306f\u6708\u5358\u4f4d\u304b\u3089\u9577\u3044\u3082\u306e\u3067\u306f\u5e74\u5358\u4f4d\u3067\u958b\u50ac\u3055\u308c\u3066\u3044\u308b\u3082\u306e\u304c\u3042\u308a\u307e\u3059\u3002\u30b3\u30f3\u30da\u30c6\u30a3\u30b7\u30e7\u30f3\u306e\u7d50\u679c\u306b\u3088\u3063\u3066\u30e9\u30f3\u30ad\u30f3\u30b0\u4ed8\u3051\u3082\u884c\u306a\u308f\u308c\u3001NOVICE\u3001KAGGLER\u3001MASTER\u306e\u9806\u306b\u3042\u304c\u3063\u3066\u3044\u304d\u307e\u3059\u3002<\/p>\n<p>\u30b3\u30f3\u30da\u30c6\u30a3\u30b7\u30e7\u30f3\u4ee5\u5916\u306b\u3082\u81ea\u5206\u306e\u30b9\u30af\u30ea\u30d7\u30c8\u3092\u516c\u958b\u3057\u305f\u308a\u3001\u4ed6\u306e\u30e6\u30fc\u30b6\u30fc\u306b\u8cea\u554f\u3057\u305f\u308a\u3001\u81ea\u5206\u306e\u30d6\u30ed\u30b0\u3092\u66f8\u304f\u3053\u3068\u304c\u3067\u304d\u308b\u6a5f\u80fd\u3082\u3042\u308a\u307e\u3059\u3002\u3055\u3089\u306b\u30c7\u30fc\u30bf\u89e3\u6790\u95a2\u4fc2\u306e\u6c42\u4eba\u30da\u30fc\u30b8\u3082\u3042\u308b\u306e\u3067\u3001Kaggle\u3067\u512a\u79c0\u306a\u30b9\u30b3\u30a2\u3092\u53d6\u3063\u3066\u3044\u308c\u3070\u30c7\u30fc\u30bf\u5206\u6790\u4f1a\u793e\u306b\u5c31\u8077\u3067\u304d\u308b\u304b\u3082\u3057\u308c\u307e\u305b\u3093\u3002<br \/>\n\u3082\u3057\u307f\u306a\u3055\u307e\u306e\u306a\u304b\u3067Kaggle\u306b\u8208\u5473\u3092\u6301\u305f\u308c\u305f\u65b9\u304c\u3044\u305f\u3089\u3001\u6311\u6226\u3057\u3066\u307f\u308b\u3053\u3068\u3092\u304a\u52e7\u3081\u3044\u305f\u3057\u307e\u3059\u3002Kaggle\u306e\u7af6\u4e89\u306b\u306f\u8ab0\u3067\u3082\u53c2\u52a0\u3059\u308b\u3053\u3068\u304c\u3067\u304d\u307e\u3059(\u6ce8\u610f: \u4e00\u90e8\u306e\u30b3\u30f3\u30da\u30c6\u30a3\u30b7\u30e7\u30f3\u306f\u4f55\u56de\u304bKaggle\u306e\u4e00\u822c\u30b3\u30f3\u30da\u30c6\u30a3\u30b7\u30e7\u30f3\u306b\u53c2\u52a0\u3057\u3066\u3044\u306a\u3044\u3068\u53c2\u52a0\u3067\u304d\u306a\u3044\u3082\u306e\u3082\u3042\u308a\u307e\u3059)\u3002<\/p>\n<h2>Springleaf Marketing Response<\/h2>\n<p>\u591a\u6570\u3042\u308b\u30b3\u30f3\u30da\u30c6\u30a3\u30b7\u30e7\u30f3\u306e\u4e2d\u304b\u3089\u4eca\u56de\u306f\u300eSpringleaf Marketing Response\u300f\u306b\u6311\u6226\u3057\u307e\u3057\u305f\u3002\u305d\u306e\u5185\u5bb9\u306f\u91d1\u878d\u4f1a\u793e\u3067\u3042\u308bSpringleaf\u793e\u304c\u6570\u591a\u304f\u306e\u500b\u4eba\u5b9b\u306b\u9001\u308b\u878d\u8cc7\u3084\u30ed\u30fc\u30f3\u306b\u3064\u3044\u3066\u306e\u30c0\u30a4\u30ec\u30af\u30c8\u30e1\u30fc\u30eb\u306e\u4e2d\u3067\u3001\u5b9f\u969b\u306b\u53cd\u5fdc\u3059\u308b\u4eba\u3092\u898b\u6975\u3081\u308b\u3053\u3068\u304c\u3067\u304d\u308b\u304b\u3092\u3001\u7d71\u8a08\u7684\u306b\u4e88\u6e2c\u3057\u3088\u3046\u3068\u3044\u3046\u3082\u306e\u3067\u3059\u3002<\/p>\n<p>\u30c0\u30a4\u30ec\u30af\u30c8\u30e1\u30fc\u30eb\u5168\u4f53\u306e\u4e2d\u3067\u53cd\u5fdc\u3059\u308b\u4eba\u304c\u591a\u304f\u306a\u308b\u307b\u3069\u3001Springleaf\u793e\u306f\u30c0\u30a4\u30ec\u30af\u30c8\u30e1\u30fc\u30eb\u3092\u52b9\u7387\u7684\u306b\u6d3b\u7528\u3059\u308b\u3053\u3068\u304c\u3067\u304d\u3001\u30b3\u30b9\u30c8\u3092\u524a\u6e1b\u3059\u308b\u3053\u3068\u304c\u53ef\u80fd\u3067\u3059\u3002\u3053\u306e\u30b3\u30f3\u30da\u30c6\u30a3\u30b7\u30e7\u30f3\u3067\u306f\u3001\u533f\u540d\u5316\u304c\u65bd\u3055\u308c\u305f\u9867\u5ba2\u30c7\u30fc\u30bf\u3092\u89e3\u6790\u3057\u305d\u308c\u305e\u308c\u306e\u9867\u5ba2\u304c\u30c0\u30a4\u30ec\u30af\u30c8\u30e1\u30fc\u30eb\u306b\u3069\u306e\u3088\u3046\u306a\u53cd\u5fdc\u3092\u793a\u3059\u304b\u3092\u4e88\u6e2c\u3057\u307e\u3059\u3002<\/p>\n<h2>\u30c7\u30fc\u30bf\u306e\u69cb\u9020\u3068\u89e3\u6790\u6226\u7565<\/h2>\n<p>\u3055\u3066\u3001\u300eSpringleaf Marketing Response\u300f\u3067\u7528\u610f\u3055\u308c\u305f\u30c7\u30fc\u30bf\u306f\u3069\u306e\u3088\u3046\u306a\u3082\u306e\u306b\u306a\u3063\u3066\u3044\u308b\u3067\u3057\u3087\u3046\u304b\u3002\u30c7\u30fc\u30bf\u306e\u69cb\u9020\u3092\u4ee5\u4e0b\u306e\u56f3\u306b\u3057\u3066\u307f\u307e\u3057\u305f\u3002<\/p>\n<table style=\"height: 158px;\" width=\"691\">\n<tbody>\n<tr>\n<td>ID<\/td>\n<td>VAR_0001<\/td>\n<td>VAR_0002<\/td>\n<td>\u2026<\/td>\n<td>VAR_1934<\/td>\n<td>target<\/td>\n<\/tr>\n<tr>\n<td>1<\/td>\n<td>\u201cR\u201d<\/td>\n<td>360<\/td>\n<td>\u2026<\/td>\n<td>\u201cIAPS\u201d<\/td>\n<td><\/td>\n<\/tr>\n<tr>\n<td>...<\/td>\n<td>...<\/td>\n<td>...<\/td>\n<td>...<\/td>\n<td>...<\/td>\n<td>...<\/td>\n<\/tr>\n<tr>\n<td>290463<\/td>\n<td>\u201cH\u201d<\/td>\n<td>228<\/td>\n<td>\u2026<\/td>\n<td>\u201cIAPS\u201d<\/td>\n<td>0<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>\u30c7\u30fc\u30bf\u5f62\u5f0f\u3084\u3001\u3069\u306e\u3088\u3046\u306a\u7a2e\u985e\u306e\u30c7\u30fc\u30bf\u304b\u308f\u304b\u3089\u306a\u3044\u5217\u304c1934\u500b\u3082\u3042\u308a\u307e\u3059\u3002\u3064\u307e\u308a\u3001\u5217\u306e\u6027\u683c\u3092\u610f\u8b58\u3057\u3066\u5206\u985e\u3059\u308b\u3053\u3068\u306f\u4e0d\u53ef\u80fd\u3067\u3042\u308b\u3001\u3068\u3044\u3046\u72b6\u614b\u306b\u306a\u3063\u3066\u3044\u307e\u3059\u3002\u307e\u305f\u30c7\u30fc\u30bf\u306e\u884c\u6570\u308230\u4e07\u884c\u3042\u308a\u3001\u5230\u5e95Excel\u3067\u6271\u3048\u308b\u3088\u3046\u306a\u30c7\u30fc\u30bf\u3067\u306f\u3042\u308a\u307e\u305b\u3093\u3002<\/p>\n<p>\u3053\u306e\u3088\u3046\u306a\u30c7\u30fc\u30bf\u304c900MB\u7a0b\u5ea6\u306e2\u3064\u306e\u30d5\u30a1\u30a4\u30ebtrain.csv\u3068 test.csv\u306b\u5206\u3051\u3089\u308c\u3066\u914d\u5e03\u3055\u308c\u307e\u3059(train.csv\u306f\u201dtarget\u201d\u30ab\u30e9\u30e0\u306b\u5024\u304c\u5165\u3063\u3066\u304a\u308a\u3001test.csv\u306b\u306f\u5024\u304c\u5165\u3063\u3066\u3044\u307e\u305b\u3093)\u3002<\/p>\n<p>\u3053\u306e\u30c7\u30fc\u30bf\u304b\u3089\u30e6\u30fc\u30b6\u30fc\u306e\u53cd\u5fdc\u3092\u4e88\u60f3\u3059\u308b\u306b\u306f\u3069\u3046\u3059\u308c\u3070\u3088\u3044\u3067\u3057\u3087\u3046\u304b\u3002\u79c1\u306f\u7d71\u8a08\u5b66\u3084\u30c7\u30fc\u30bf\u89e3\u6790\u306b\u306f\u8a73\u3057\u304f\u306a\u3044\u306e\u3067\u3059\u304c\u3001\u4ee5\u4e0b\u306e\u3088\u3046\u306a\u6226\u7565\u3092\u7acb\u3066\u307e\u3057\u305f\u3002<\/p>\n<p>\u76ee\u7684 : test.csv\u306e\u5404\u30e6\u30fc\u30b6\u30fc\u306etarget\u30ab\u30e9\u30e0\u5024\u3092\u4e88\u60f3\u3059\u308b\u3002<\/p>\n<p>\u4eee\u8aac : target\u30ab\u30e9\u30e0\u5024\u306f\u30e6\u30fc\u30b6\u30fc\u306e\u4ed6\u306e\u7279\u5b9a\u306e\u30ab\u30e9\u30e0\u5024\u3068\u4f9d\u5b58\u95a2\u4fc2\u304c\u3042\u308b\u3002<\/p>\n<p style=\"padding-left: 30px;\">\u4eee\u8aac\u3088\u308atest.csv\u306e\u5404\u30e6\u30fc\u30b6\u30fc\u304ctrain.csv\u306e\u53cd\u5fdc\u306e\u6709\u3063\u305f\u30e6\u30fc\u30b6\u30fc\u7fa4\u3068\u3069\u306e\u304f\u3089\u3044\u4f3c\u3066\u3044\u308b\u304b\u3092\u8a08\u7b97\u3059\u308c\u3070\u3088\u3044\u306e\u3067\u306f\u306a\u3044\u3060\u308d\u3046\u304b\u3068\u8003\u3048\u305f\u3002<\/p>\n<p style=\"padding-left: 30px;\">\u5177\u4f53\u7684\u306a\u624b\u9806\u3068\u3057\u3066\u306f<\/p>\n<ol>\n<li>\u30c8\u30ec\u30fc\u30cb\u30f3\u30b0\u30c7\u30fc\u30bf\u3092\u53cd\u5fdc\u306e\u3042\u3063\u305f\u30b0\u30eb\u30fc\u30d7\u3068\u7121\u304b\u3063\u305f\u30b0\u30eb\u30fc\u30d7\u306b\u5206\u3051\u308b\u3002target\u306e\u5024\u304c0\u304b1\u304b\u3067\u5224\u65ad\u3059\u308b\u3002<\/li>\n<li>\u985e\u4f3c\u5ea6\u306e\u57fa\u6e96\u3068\u306a\u308b\u30c7\u30fc\u30bf\u3092\u4f5c\u308b\u3002\u53cd\u5fdc\u306e\u6709\u3063\u305f\u30b0\u30eb\u30fc\u30d7\u306e\u5e73\u5747\u5024\u3092\u51fa\u3057\u57fa\u6e96\u3068\u3059\u308b\u3002\u5404\u30d1\u30e9\u30e1\u30fc\u30bf\u306e\u6a19\u6e96\u5316\u306e\u305f\u3081\u306b\u5206\u6563\u30fb\u6a19\u6e96\u504f\u5dee\u3082\u6c42\u3081\u307e\u3059\u3002<\/li>\n<li>\u30c6\u30b9\u30c8\u30c7\u30fc\u30bf\u306e\u30e6\u30fc\u30b6\u30c7\u30fc\u30bf\u304c\u3069\u308c\u304f\u3089\u3044\u53cd\u5fdc\u306e\u3042\u3063\u305f\u30b0\u30eb\u30fc\u30d7\u306b\u4f3c\u3066\u3044\u308b\u304b\u3092\u8abf\u3079\u308b\u3002<\/li>\n<\/ol>\n<h2>\u89e3\u6790\u74b0\u5883<\/h2>\n<p>\u4eca\u56de\u306fDigital Ocean\u306e\u30af\u30e9\u30a6\u30c9\u74b0\u5883\u3092\u4f7f\u3063\u3066\u5206\u6790\u3092\u884c\u3044\u307e\u3057\u305f\u3002<\/p>\n<p>\u74b0\u5883\u306f\u4ee5\u4e0b\u306e\u901a\u308a\u3067\u3059\u3002<\/p>\n<ul style=\"list-style-type: disc;\">\n<li>CPU: 2\u30b3\u30a2\u3001 2\u30b9\u30ec\u30c3\u30c9<\/li>\n<li>\u30e1\u30e2\u30ea: 4GB<\/li>\n<li>SSD: 60GB<\/li>\n<li>OS: CentOS7<\/li>\n<li>Spark 1.5.0<\/li>\n<li>Hadoop 2.6<\/li>\n<li>SBT 0.13.9<\/li>\n<\/ul>\n<h2>\u30c7\u30fc\u30bf\u89e3\u6790\u3092\u59cb\u3081\u308b<\/h2>\n<p>\u305d\u308c\u3067\u306f\u30c7\u30fc\u30bf\u89e3\u6790\u3092\u59cb\u3081\u307e\u3057\u3087\u3046\u3002\u4eca\u56de\u89e3\u6790\u30d7\u30ed\u30b0\u30e9\u30e0\u306fSpark+Scala\u3067\u4f5c\u6210\u3057\u307e\u3057\u305f\u3002<\/p>\n<p>1. \u30c7\u30fc\u30bf(train.csv)\u306e\u5206\u5272<\/p>\n<p>\u30c8\u30ec\u30fc\u30cb\u30f3\u30b0\u30c7\u30fc\u30bf\u306e\u5206\u5272\u3092\u884c\u3044\u307e\u3059\u3002<\/p>\n<p>\u300c\u30c7\u30fc\u30bf\u69cb\u9020\u3068\u89e3\u6790\u6226\u7565\u300d\u3067\u56f3\u793a\u3057\u305f\u30c7\u30fc\u30bf\u69cb\u9020\u306e\u6700\u5f8c\u306e\u30ab\u30e9\u30e0\"target\u201d\u306e\u5024\u304c\u30e6\u30fc\u30b6\u30fc\u306e\u53cd\u5fdc\u3092\u8868\u308f\u3057\u3066\u3044\u307e\u3059\u3002target\u306e\u5024\u304c1\u3067\u3042\u308c\u3070\u3001\u305d\u306e\u30e6\u30fc\u30b6\u30fc\u306fDM\u306b\u53cd\u5fdc\u3092\u3057\u3066\u3044\u307e\u3059\u3002\u5024\u304c0\u306a\u3089\u3070\u53cd\u5fdc\u3092\u3057\u3066\u3044\u307e\u305b\u3093\u3002<\/p>\n<p>\u305d\u3053\u3067target\u306e\u5024\u3092\u4f7f\u3044\u3001\u30e6\u30fc\u30b6\u30fc\u3092\u53cd\u5fdc\u306e\u6709\u7121\u3067\u5206\u5272\u3057\u307e\u3059\u3002<\/p>\n<pre>\/\/\u30c8\u30ec\u30fc\u30cb\u30f3\u30b0\u30c7\u30fc\u30bf\u306e\u8aad\u307f\u8fbc\u307f\u3068\u5206\u985e\r\nval dataset = sc.textFile(\u201ctrain.csv\u201d).map(line =&gt; line.split(\u201c,\u201d))\r\n\r\nval train0 = dataset.filter(line =&gt; line.last==\u201d0\u201d) \u00a0\u00a0\u00a0\u00a0\/\/\u53cd\u5fdc\u7121\u3057\r\nval train1 = dataset.filter(line =&gt; line.last==\u201d1\u201d) \u00a0\u00a0\u00a0\u00a0\/\/\u53cd\u5fdc\u6709\u308a\r\n<\/pre>\n<p>2. \u5e73\u5747\u30fb\u5206\u6563\u306a\u3069\u306e\u8a08\u7b97<\/p>\n<p>train0\u3068train1\u306e\u5e73\u5747\u30fb\u5206\u6563\u30fb\u6a19\u6e96\u504f\u5dee\u3092\u8a08\u7b97\u3057\u3001\u5e73\u5747\u306e\u5dee\u306e\u5272\u5408\u3082\u6c42\u3081\u307e\u3059(data0\u306f\u53cd\u5fdc\u306e\u7121\u304b\u3063\u305f\u30b0\u30eb\u30fc\u30d7\u3001data1\u306f\u53cd\u5fdc\u306e\u6709\u3063\u305f\u30b0\u30eb\u30fc\u30d7)\u3002<\/p>\n<p>\u305d\u308c\u3092\u5404\u30e6\u30fc\u30b6ID\u3067\u307e\u3068\u3081\u305fRDD\u3092\u4f5c\u6210\u3057\u307e\u3059\u3002<\/p>\n<pre>var rlist: List[String] = List()\r\nval leng = dataset.first.length\r\n\r\nfor ( i &lt;- 1 to leng -1 ) {\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0var data0 = train0.map(line =&gt; line(i)).filter(x =&gt; !(x contains \u201c\\\u201d\u201d)).filter(x =&gt; x!=\u201dNA\u201d).map(x =&gt; x.toDouble).cache\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0var data1 = train1.map(line =&gt; line(i)).filter(x =&gt; !(x contains \u201c\\\u201d\u201d)).filter(x =&gt; x!=\u201dNA\u201d).map(x =&gt; x.toDouble).cache\r\n\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0var diff = (data0.mean - data1.mean)\/data1.mean.abs\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0rlist = i +\u201d,\u201d+ diff +\u201d,\u201d+ data0.mean +\u201d,\u201d+ data1.mean +\u201d,\u201d+ data1.variance +\u201d,\u201d+ data1.stdev :: rlist\r\n}\r\n\r\nrlist = rlist.reverse\r\nval rlistrdd = sc.parallelize(rlist).map(line =&gt; line.split(\u201c,\u201d))\r\n<\/pre>\n<p>\u3053\u308c\u3067{\u30e6\u30fc\u30b6ID, \u5e73\u5747\u5dee\u306e\u5272\u5408, data0\u306e\u5e73\u5747, data1\u306e\u5e73\u5747, data1\u306e\u5206\u6563, data1\u306e\u6a19\u6e96\u504f\u5dee}\u306e\u8981\u7d20\u3092\u6301\u3063\u305fRDD\u304c\u4f5c\u6210\u3055\u308c\u307e\u3057\u305f\u3002<\/p>\n<p>3. \u30c6\u30b9\u30c8\u30c7\u30fc\u30bf\u306e\u5404\u30e6\u30fc\u30b6\u306e\u985e\u4f3c\u5ea6\u3092\u8abf\u3079\u308b\u3002<\/p>\n<p>\u30c8\u30ec\u30fc\u30cb\u30f3\u30b0\u30c7\u30fc\u30bf\u3068\u30c6\u30b9\u30c8\u30c7\u30fc\u30bf\u304c\u540c\u3058\u6bcd\u96c6\u56e3\u3092\u6301\u3064\u3068\u3057\u3066\u3001\u30c6\u30b9\u30c8\u30c7\u30fc\u30bf\u306e\u30e6\u30fc\u30b6\u306e\u8981\u7d20\u3092\u524d\u8ff0\u3067\u6c42\u3081\u305ftrain1\u306e\u5e73\u5747\u3068\u6a19\u6e96\u504f\u5dee\u3067\u6a19\u6e96\u5316\u3092\u884c\u3046\u3002\u6a19\u6e96\u5316\u3055\u308c\u305f\u30c7\u30fc\u30bf\u306f\u5e73\u5747\u304c0\u3067\u6a19\u6e96\u504f\u5dee\u304c1\u306a\u306e\u3067\u3001-3\uff5e3\u306e\u9593\u306b99.74%\u306e\u30c7\u30fc\u30bf\u304c\u5b58\u5728\u3059\u308b\u3053\u3068\u306b\u306a\u308a\u307e\u3059\u3002<\/p>\n<p>\u3064\u307e\u308a\u985e\u4f3c\u5ea6\u95a2\u6570\u306f\u5b9a\u7fa9\u57df\u304c-3\u2266x\u22663\u3067\u3001x=\u00b13\u3067f(x)\u22520\u3001x=0\u3067f(x)=1\u3068\u306a\u308b\u3088\u3046\u306b\u5b9a\u7fa9\u3059\u308c\u3070\u3088\u3044\u3002<\/p>\n<p>\u3053\u3046\u3059\u308b\u3053\u3068\u306b\u3088\u3063\u30661\u3064\u306e\u8981\u7d20\u304c\u3069\u308c\u3060\u3051\u4f3c\u3066\u3044\u308b\u304b\u304c\u30d1\u30fc\u30bb\u30f3\u30c6\u30fc\u30b8\u3067\u308f\u304b\u308a\u3001\u305d\u308c\u3092\u8981\u7d20\u9593\u3067\u639b\u3051\u3042\u308f\u305b\u308c\u3070\u5168\u4f53\u3068\u3057\u3066\u3069\u308c\u3060\u3051\u985e\u4f3c\u3057\u3066\u3044\u308b\u304b\u304c\u5206\u304b\u308b\u306f\u305a\u3067\u3059\u3002<\/p>\n<pre>\/\/\u985e\u4f3c\u5ea6\u8a08\u7b97\u7528\u95a2\u6570\u306e\u5b9a\u7fa9\r\ndef similarityFunc(line: Array[String], diffsort: Array[Array[String]]): (String, Double) = {\r\n        var sum = 1.0\r\n        var LENG = diffsort.length\r\n        for ( i  1\/line(1).toDouble.abs).filter(line =&gt; line(1)!=\u201dInfinity\u201d).take(100)\r\n                     \/\/ \u4e0a\u8a18\u306e\u3088\u3046\u306a\u65b9\u6cd5\u3067\u95a2\u4fc2\u304c\u6df1\u305d\u3046\u306a\u8981\u7d20\u3092\u3044\u308d\u3044\u308d\u8a66\u3059\u3002\r\n\r\n\/\/\u30c6\u30b9\u30c8\u30c7\u30fc\u30bf\u306e\u8aad\u307f\u8fbc\u307f\r\nvar source = Source.fromFile(\u201ctest.csv\u201d)      \/\/ scala.io.Source\u306e\u30a4\u30f3\u30dd\u30fc\u30c8\u304c\u5fc5\u8981\r\n\r\n\/\/var simil: List[(String, Double)] = List()\r\nval savefile = new PrintWriter(\u201cresult.csv\u201d)\r\n\r\nfor ( line &lt;- source.getLines) {\r\n        var linesplit = line split \u2018,\u2019\r\n        var data = similarityFunc(linesplit, diffsort)\r\n\/\/        simil = data :: simil\r\n        file.write(data._1 +\u201d,\u201d+ data._2 +\u201d\\n\u201d)\r\n}\r\n\r\nsource.close\r\nfile.close<\/pre>\n<p>diffsort\u306f\u3069\u306e\u8981\u7d20\u306e\u985e\u4f3c\u5ea6\u3092\u8a08\u7b97\u3059\u308b\u304b\u3092\u6c7a\u5b9a\u3057\u3066\u3044\u307e\u3059\u3002\u3053\u306e\u65b9\u6cd5\u306f\u3069\u306e\u8981\u7d20\u3092\u4f7f\u3046\u304b\u304c\u975e\u5e38\u306b\u91cd\u8981\u306b\u306a\u3063\u3066\u304d\u307e\u3059\u3002\u4e0a\u8a18\u306e\u30d7\u30ed\u30b0\u30e9\u30e0\u3067\u306fdiffsort\u3067\u5e73\u5747\u5dee\u304c\u5927\u304d\u3044\u3082\u306e\u3092\u9078\u629e\u3057\u3066\u985e\u4f3c\u5ea6\u3092\u8a08\u7b97\u3055\u305b\u307e\u3057\u305f\u3002<\/p>\n<h2>\u8a08\u7b97\u7d50\u679c<\/h2>\n<p>result.csv\u306e\u4e2d\u8eab\u306f\u4ee5\u4e0b\u306e\u3088\u3046\u306a\u5f62\u5f0f\u306b\u306a\u308a\u307e\u3059\u3002\u7b2c1\u30ab\u30e9\u30e0\u304c\u30e6\u30fc\u30b6\u30fcID\u3067\u3001\u7b2c2\u30ab\u30e9\u30e0\u304c\u4e88\u6e2c\u5024\u306b\u306a\u308a\u307e\u3059\u3002<\/p>\n<pre>1,0.218461508716\r\n3,0.174768860363\r\n6,0.179894746315\r\n9,0.272701919919\r\n10,0.0399027661514\r\n11,0.234088594181\r\n12,0.226296085687\r\n13,0.273507234459\r\n15,0.306056192286\r\n       \uff5e(\u4ee5\u4e0b\u7565)\uff5e\r\n<\/pre>\n<h2>\u8a08\u7b97\u7d50\u679c\u3092\u6295\u7a3f\u3057\u3066\u307f\u308b<\/h2>\n<p>\u8a08\u7b97\u7d50\u679c\u304c\u51fa\u6765\u305f\u306e\u3067\u3001\u7d50\u679c\u3092Kaggle\u306b\u6295\u7a3f\u3057\u3066\u307f\u307e\u3057\u3087\u3046\u3002\u6295\u7a3f\u306fKaggle\u306e\u6311\u6226\u3057\u3066\u3044\u308b\u30b3\u30f3\u30da\u30c6\u30a3\u30b7\u30e7\u30f3\u306e\u30da\u30fc\u30b8\u3067\u884c\u306a\u3048\u307e\u3059\u3002\u5de6\u306b\u3042\u308b\u30e1\u30cb\u30e5\u30fc\u304b\u3089\"Make a submission\u201d\u306b\u79fb\u52d5\u3057\u3001\u63d0\u51fa\u3057\u305f\u3044\u30d5\u30a1\u30a4\u30eb\u3092\u9078\u629e\u3057\u307e\u3057\u3087\u3046\u3002<\/p>\n<p>\u6295\u7a3f\u3059\u308b\u3068\u3059\u3050\u7b54\u3048\u5408\u308f\u305b\u304c\u81ea\u52d5\u3067\u958b\u59cb\u3055\u308c\u307e\u3059\u3002\u5c11\u3057(30\u79d2\u304f\u3089\u3044)\u5f85\u3064\u3068\u7b54\u3048\u5408\u308f\u305b\u304c\u7d42\u4e86\u3057\u3001\u7d50\u679c\u304c\u8868\u793a\u3055\u308c\u307e\u3059\u3002<\/p>\n<p>\u4eca\u56de\u6311\u6226\u3057\u305f\u30b3\u30f3\u30da\u30c6\u30a3\u30b7\u30e7\u30f3\u3067\u306f\u3001\u7d50\u679c\u306f1\u4ee5\u4e0b\u3067\u5c0f\u6570\u70b9\u7b2c5\u4f4d\u307e\u3067\u306e\u6570\u3067\u8868\u793a\u3055\u308c\u30011\u306b\u8fd1\u3044\u307b\u3069\u4e88\u6e2c\u306e\u7cbe\u5ea6\u304c\u826f\u304f\u3001\u5f97\u70b9\u304c\u9ad8\u3044\u3067\u3059\u3002<\/p>\n<h2>\u4eca\u56de\u306e\u7d50\u679c<\/h2>\n<p>\u79c1\u306e\u6700\u7d42\u7684\u306a\u5f97\u70b9\u306f0.55012\u3067\u3057\u305f\u3002\u9806\u4f4d\u3067\u3044\u3048\u30702226\u30c1\u30fc\u30e0\u4e2d\u306e2076\u4f4d\u3067\u3057\u305f\u30021\u4f4d\u306e\u30c1\u30fc\u30e0\u306e\u5f97\u70b9\u306f0.80427\u3067\u3057\u305f\u3002<\/p>\n<p>\u53c2\u52a0\u8005\u306f\u81ea\u5206\u304c\u4f5c\u6210\u3057\u305fscript\u3092\u516c\u958b\u3057\u3066\u3001\u4ed6\u306e\u53c2\u52a0\u8005\u3068\u610f\u898b\u3092\u4ea4\u63db\u3057\u305f\u308a\u3001\u30a2\u30c9\u30d0\u30a4\u30b9\u3092\u8cb0\u3063\u305f\u308a\u3059\u308b\u3053\u3068\u304c\u3067\u304d\u3001\u4ed6\u306e\u4eba\u304c\u516c\u958b\u3057\u3066\u3044\u308b\u60c5\u5831\u3092\u898b\u3066\u81ea\u5206\u306e\u30c7\u30fc\u30bf\u89e3\u6790\u306b\u5f79\u7acb\u3066\u308b\u3053\u3068\u304c\u53ef\u80fd\u3067\u3059\u3002\u4eca\u56de\u79c1\u306f\u5e73\u5747\u3092\u6c42\u3081\u3066\u95a2\u4fc2\u306e\u3042\u308a\u305d\u3046\u306a\u30d1\u30e9\u30e1\u30fc\u30bf\u3092\u8abf\u3079\u3066\u985e\u4f3c\u5ea6\u3068\u3057\u3066\u4e88\u6e2c\u3092\u5c0e\u304d\u307e\u3057\u305f\u304c\u3001\u4ed6\u306e\u53c2\u52a0\u8005\u306b\u306f\u30af\u30e9\u30b9\u30bf\u30ea\u30f3\u30b0\u3092\u884c\u306a\u3063\u3066\u30d1\u30e9\u30e1\u30fc\u30bf\u9593\u306e\u95a2\u4fc2\u3092\u8abf\u3079\u305f\u308a\u3001\u533f\u540d\u5316\u3055\u308c\u305f\u30c7\u30fc\u30bf\u306e\u5143\u306e\u610f\u5473\u3092\u63a2\u3063\u3066\u3044\u308b\u4eba\u3082\u3044\u307e\u3057\u305f\u3002<\/p>\n<h2>\u307e\u3068\u3081<\/h2>\n<p>Kaggle\u306b\u6311\u6226\u3059\u308b\u3053\u3068\u3092\u6c7a\u3081\u305f\u5f53\u521d\u306f\u5165\u8cde\u3057\u3066\u8cde\u91d1\u3092\u9802\u3053\u3046\u3068\u5927\u304d\u304f\u606f\u5dfb\u3044\u3066\u3044\u307e\u3057\u305f\u304c\u3001\u5b9f\u969b\u306b\u6311\u6226\u3057\u3066\u307f\u308b\u3068\u30c7\u30fc\u30bf\u89e3\u6790\u306b\u306f\u69d8\u3005\u306a\u77e5\u8b58\u304c\u5fc5\u8981\u3068\u3055\u308c\u3001\u3042\u3048\u306a\u304f\u7121\u60e8\u306a\u7d50\u679c\u306b\u7d42\u308f\u3063\u3066\u3057\u307e\u3044\u307e\u3057\u305f\u3002\u3057\u304b\u3057\u3001\u89e3\u6790\u3092\u9032\u3081\u308b\u306a\u304b\u3067\u8272\u3005\u3068\u8abf\u3079\u305f\u3053\u3068\u3084\u3001\u4ed6\u306e\u53c2\u52a0\u8005\u306escript\u3092\u898b\u305f\u308a\u3068\u591a\u304f\u306e\u5f97\u308b\u3082\u306e\u304c\u3042\u308a\u307e\u3057\u305f\u3002\u7279\u306b\u4e0a\u4f4d\u9663\u304c\u516c\u958b\u3057\u3066\u3044\u308b\u60c5\u5831\u306f\u521d\u5fc3\u8005\u3067\u3042\u308b\u79c1\u306b\u306f\u96e3\u3057\u304b\u3063\u305f\u3067\u3059\u304c\u3001\u975e\u5e38\u306b\u70ba\u306b\u306a\u308b\u3082\u306e\u304c\u591a\u304b\u3063\u305f\u3067\u3059\u3002<\/p>\n<p>\u4ed6\u306e\u53c2\u52a0\u8005\u306e\u591a\u304f\u306fR\u3084Python\u3092\u4f7f\u7528\u3057\u3066\u3044\u307e\u3057\u305f\u3002<\/p>\n<p>Spark\u3067\u30c7\u30fc\u30bf\u89e3\u6790\u3092\u884c\u306a\u3063\u305f\u611f\u60f3\u3068\u3057\u307e\u3057\u3066\u306f\u3001RDD\u306e\u64cd\u4f5c\u306b\u624b\u3053\u305a\u308a\u307e\u3057\u305f\u3002\u5404RDD\u306e\u884c\u5358\u4f4d\u3067\u306e\u51e6\u7406\u304c\u4e0a\u624b\u304f\u6271\u3048\u305a\u306b\u82e6\u52b4\u3057\u3066\u3001\u7d50\u5c40Scala\u306e\u30ea\u30b9\u30c8\u3084\u30d5\u30a1\u30a4\u30eb\u64cd\u4f5c\u3092\u4f7f\u3063\u3066\u3057\u307e\u3044\u307e\u3057\u305f\u3002\u306a\u3093\u3067\u3082\u304b\u3093\u3067\u3082Spark\u3067\u51e6\u7406\u3059\u308b\u306e\u3067\u306f\u306a\u304f\u3001\u5206\u6563\u51e6\u7406\u306a\u3069\u306eSpark\u304c\u5f97\u610f\u3068\u3059\u308b\u5206\u91ce\u3067\u4f7f\u3063\u305f\u308a\u3068\u3001\u7d44\u307f\u5408\u308f\u305b\u304c\u5927\u5207\u3060\u3068\u611f\u3058\u307e\u3057\u305f\u3002<\/p>\n<p>Spark\u306b\u3064\u3044\u3066\u306e\u826f\u3044\u52c9\u5f37\u306b\u3082\u306a\u3063\u305f\u3068\u601d\u3044\u307e\u3059\u3002\u3053\u308c\u304b\u3089\u3082Spark\u3092\u4f7f\u3063\u3066\u69d8\u3005\u306a\u5206\u6790\u306b\u6311\u6226\u3057\u3066\u3044\u304d\u305f\u3044\u3068\u601d\u3044\u307e\u3059\uff01<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u306f\u3058\u3081\u306b \u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3\u306e\u85e4\u7530\u3067\u3059\u3002 \u4eca\u56de\u306fSpark\u3092\u4f7f\u3063\u3066\u30c7\u30fc\u30bf\u89e3\u6790\u3092\u884c\u3044\u307e\u3057\u305f\u3002\u30c7\u30fc\u30bf\u30b5\u30a4\u30a8\u30f3\u30c6\u30a3\u30b9\u30c8\u306e\u30b3\u30df\u30e5\u30cb\u30c6\u30a3\u3067\u3042\u308bKaggle\u3067\u884c\u308f\u308c\u3066\u3044\u308b\u7af6\u6280\u306b\u53c2\u52a0\u3057\u307e\u3057\u305f\u3002\u6311\u6226\u3057\u305f\u8ab2\u984c\u306f\u300eSpringleaf M [&#8230;]<\/p>\n","protected":false},"author":1,"featured_media":12352,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"content-type":"","footnotes":""},"categories":[51,16],"tags":[50],"class_list":["post-12281","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-spark","category-author","tag-spark"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Kaggle\u306b\u6311\u6226\u3057\u3066\u307f\u305f - Tech Blog\uff5c\u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3<\/title>\n<meta name=\"description\" content=\"Spark, \u8457\u8005\uff08Author\uff09 |\u306f\u3058\u3081\u306b \u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3\u306e\u85e4\u7530\u3067\u3059\u3002\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281\" \/>\n<meta property=\"og:locale\" content=\"ja_JP\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Kaggle\u306b\u6311\u6226\u3057\u3066\u307f\u305f - Tech Blog\uff5c\u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3\" \/>\n<meta property=\"og:description\" content=\"Spark, \u8457\u8005\uff08Author\uff09 |\u306f\u3058\u3081\u306b \u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3\u306e\u85e4\u7530\u3067\u3059\u3002\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281\" \/>\n<meta property=\"og:site_name\" content=\"Tech Blog\uff5c\u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/creationline\" \/>\n<meta property=\"article:published_time\" content=\"2015-12-17T05:20:53+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"942\" \/>\n\t<meta property=\"og:image:height\" content=\"261\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"admin\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@creationline\" \/>\n<meta name=\"twitter:site\" content=\"@creationline\" \/>\n<meta name=\"twitter:label1\" content=\"\u57f7\u7b46\u8005\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin\" \/>\n\t<meta name=\"twitter:label2\" content=\"\u63a8\u5b9a\u8aad\u307f\u53d6\u308a\u6642\u9593\" \/>\n\t<meta name=\"twitter:data2\" content=\"2\u5206\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281\"},\"author\":{\"name\":\"admin\",\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/#\/schema\/person\/7d923d1c017568a1a5e66d7bb1c8764a\"},\"headline\":\"Kaggle\u306b\u6311\u6226\u3057\u3066\u307f\u305f\",\"datePublished\":\"2015-12-17T05:20:53+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281\"},\"wordCount\":125,\"image\":{\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle.jpg\",\"keywords\":[\"Spark\"],\"articleSection\":[\"Spark\",\"\u8457\u8005\uff08Author\uff09\"],\"inLanguage\":\"ja\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281\",\"url\":\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281\",\"name\":\"Kaggle\u306b\u6311\u6226\u3057\u3066\u307f\u305f - Tech Blog\uff5c\u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3\",\"isPartOf\":{\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle.jpg\",\"datePublished\":\"2015-12-17T05:20:53+00:00\",\"author\":{\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/#\/schema\/person\/7d923d1c017568a1a5e66d7bb1c8764a\"},\"description\":\"Spark, \u8457\u8005\uff08Author\uff09 |\u306f\u3058\u3081\u306b \u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3\u306e\u85e4\u7530\u3067\u3059\u3002\",\"breadcrumb\":{\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#breadcrumb\"},\"inLanguage\":\"ja\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"ja\",\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#primaryimage\",\"url\":\"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle.jpg\",\"contentUrl\":\"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle.jpg\",\"width\":942,\"height\":261},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"HOME\",\"item\":\"https:\/\/www.creationline.com\/tech-blog\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"\u305d\u306e\u4ed6\",\"item\":\"https:\/\/www.creationline.com\/tech-blog\/others\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Spark\",\"item\":\"https:\/\/www.creationline.com\/tech-blog\/others\/spark\"},{\"@type\":\"ListItem\",\"position\":4,\"name\":\"Kaggle\u306b\u6311\u6226\u3057\u3066\u307f\u305f\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/#website\",\"url\":\"https:\/\/www.creationline.com\/tech-blog\/\",\"name\":\"Tech Blog\uff5c\u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3\",\"description\":\"\u30a2\u30b8\u30e3\u30a4\u30eb\uff06DevOps\u3001\u30af\u30e9\u30a6\u30c9\u30cd\u30a4\u30c6\u30a3\u30d6\u3001AI\uff06LLM\u306e\u5148\u7aef\u6280\u8853\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.creationline.com\/tech-blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"ja\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/#\/schema\/person\/7d923d1c017568a1a5e66d7bb1c8764a\",\"name\":\"admin\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ja\",\"@id\":\"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2021\/12\/avatar.png\",\"url\":\"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2021\/12\/avatar.png\",\"contentUrl\":\"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2021\/12\/avatar.png\",\"caption\":\"admin\"},\"url\":\"https:\/\/www.creationline.com\/tech-blog\/author\/admin\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Kaggle\u306b\u6311\u6226\u3057\u3066\u307f\u305f - Tech Blog\uff5c\u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3","description":"Spark, \u8457\u8005\uff08Author\uff09 |\u306f\u3058\u3081\u306b \u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3\u306e\u85e4\u7530\u3067\u3059\u3002","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281","og_locale":"ja_JP","og_type":"article","og_title":"Kaggle\u306b\u6311\u6226\u3057\u3066\u307f\u305f - Tech Blog\uff5c\u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3","og_description":"Spark, \u8457\u8005\uff08Author\uff09 |\u306f\u3058\u3081\u306b \u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3\u306e\u85e4\u7530\u3067\u3059\u3002","og_url":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281","og_site_name":"Tech Blog\uff5c\u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3","article_publisher":"https:\/\/www.facebook.com\/creationline","article_published_time":"2015-12-17T05:20:53+00:00","og_image":[{"width":942,"height":261,"url":"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle.jpg","type":"image\/jpeg"}],"author":"admin","twitter_card":"summary_large_image","twitter_creator":"@creationline","twitter_site":"@creationline","twitter_misc":{"\u57f7\u7b46\u8005":"admin","\u63a8\u5b9a\u8aad\u307f\u53d6\u308a\u6642\u9593":"2\u5206"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#article","isPartOf":{"@id":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281"},"author":{"name":"admin","@id":"https:\/\/www.creationline.com\/tech-blog\/#\/schema\/person\/7d923d1c017568a1a5e66d7bb1c8764a"},"headline":"Kaggle\u306b\u6311\u6226\u3057\u3066\u307f\u305f","datePublished":"2015-12-17T05:20:53+00:00","mainEntityOfPage":{"@id":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281"},"wordCount":125,"image":{"@id":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#primaryimage"},"thumbnailUrl":"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle.jpg","keywords":["Spark"],"articleSection":["Spark","\u8457\u8005\uff08Author\uff09"],"inLanguage":"ja"},{"@type":"WebPage","@id":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281","url":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281","name":"Kaggle\u306b\u6311\u6226\u3057\u3066\u307f\u305f - Tech Blog\uff5c\u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3","isPartOf":{"@id":"https:\/\/www.creationline.com\/tech-blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#primaryimage"},"image":{"@id":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#primaryimage"},"thumbnailUrl":"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle.jpg","datePublished":"2015-12-17T05:20:53+00:00","author":{"@id":"https:\/\/www.creationline.com\/tech-blog\/#\/schema\/person\/7d923d1c017568a1a5e66d7bb1c8764a"},"description":"Spark, \u8457\u8005\uff08Author\uff09 |\u306f\u3058\u3081\u306b \u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3\u306e\u85e4\u7530\u3067\u3059\u3002","breadcrumb":{"@id":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#breadcrumb"},"inLanguage":"ja","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281"]}]},{"@type":"ImageObject","inLanguage":"ja","@id":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#primaryimage","url":"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle.jpg","contentUrl":"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2015\/12\/kaggle.jpg","width":942,"height":261},{"@type":"BreadcrumbList","@id":"https:\/\/www.creationline.com\/tech-blog\/others\/spark\/12281#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"HOME","item":"https:\/\/www.creationline.com\/tech-blog"},{"@type":"ListItem","position":2,"name":"\u305d\u306e\u4ed6","item":"https:\/\/www.creationline.com\/tech-blog\/others"},{"@type":"ListItem","position":3,"name":"Spark","item":"https:\/\/www.creationline.com\/tech-blog\/others\/spark"},{"@type":"ListItem","position":4,"name":"Kaggle\u306b\u6311\u6226\u3057\u3066\u307f\u305f"}]},{"@type":"WebSite","@id":"https:\/\/www.creationline.com\/tech-blog\/#website","url":"https:\/\/www.creationline.com\/tech-blog\/","name":"Tech Blog\uff5c\u30af\u30ea\u30a8\u30fc\u30b7\u30e7\u30f3\u30e9\u30a4\u30f3","description":"\u30a2\u30b8\u30e3\u30a4\u30eb\uff06DevOps\u3001\u30af\u30e9\u30a6\u30c9\u30cd\u30a4\u30c6\u30a3\u30d6\u3001AI\uff06LLM\u306e\u5148\u7aef\u6280\u8853","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.creationline.com\/tech-blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"ja"},{"@type":"Person","@id":"https:\/\/www.creationline.com\/tech-blog\/#\/schema\/person\/7d923d1c017568a1a5e66d7bb1c8764a","name":"admin","image":{"@type":"ImageObject","inLanguage":"ja","@id":"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2021\/12\/avatar.png","url":"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2021\/12\/avatar.png","contentUrl":"https:\/\/www.creationline.com\/tech-blog\/cms_x3GWkuX\/wp-content\/uploads\/2021\/12\/avatar.png","caption":"admin"},"url":"https:\/\/www.creationline.com\/tech-blog\/author\/admin"}]}},"_links":{"self":[{"href":"https:\/\/www.creationline.com\/tech-blog\/wp-json\/wp\/v2\/posts\/12281","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.creationline.com\/tech-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.creationline.com\/tech-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.creationline.com\/tech-blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.creationline.com\/tech-blog\/wp-json\/wp\/v2\/comments?post=12281"}],"version-history":[{"count":5,"href":"https:\/\/www.creationline.com\/tech-blog\/wp-json\/wp\/v2\/posts\/12281\/revisions"}],"predecessor-version":[{"id":12354,"href":"https:\/\/www.creationline.com\/tech-blog\/wp-json\/wp\/v2\/posts\/12281\/revisions\/12354"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.creationline.com\/tech-blog\/wp-json\/wp\/v2\/media\/12352"}],"wp:attachment":[{"href":"https:\/\/www.creationline.com\/tech-blog\/wp-json\/wp\/v2\/media?parent=12281"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.creationline.com\/tech-blog\/wp-json\/wp\/v2\/categories?post=12281"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.creationline.com\/tech-blog\/wp-json\/wp\/v2\/tags?post=12281"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}