ارزيابي روش‌هاي داده‌كاوي و آمار دومتغيره در پهنه‌بندي خطر وقوع گسل‌ (مطالعه موردي: حوضه قره‌قوم)

عنوان به زبان ديگر

Evaluation of data mining and bivariate statistical methods in risk zoning of fault occurrence (Case study: Qara-Qum watershed)

پديد آورندگان

بشيري، مهدي دانشگاه تربت‌حيدريه - دانشكده كشاورزي و منابع طبيعي - گروه مهندسي طبيعت و گياهان دارويي، خراسان رضوي، ايران , كاوسي داودي، مائده شركت سامانه آب كومش، خراسان رضوي، ايران , افضلي، علي دانشگاه تهران - مركز تحقيقات بين‌المللي بيابان - گروه آموزشي مديريت مناطق بياباني، ايران

تعداد صفحه

از صفحه

1003

تا صفحه

1019

كليدواژه

ارزش اطلاعات , الگوريتم طبقه‌بندي , تراكم سطح , منحني ROC

چكيده فارسي

گسل از فرآيندهاي اصلي زمين ريختي است كه با پديده‌هايي همچون زمين‌لرزه ارتباط دارد. لذا آگاهي از اثر عوامل محيطي در بروز گسل و شناخت مناطق پرخطر اهميت زيادي دارد كه اين اهداف در پژوهش حاضر بررسي شدهاند. عرصه پژوهش حوضه قره قوم است كه پس از تعيين گسل هاي آن، متغيرهاي ارتفاع، شيب، جهت، اقليم، كاربري، ژئومورفولوژي، سنگشناسي، فرسايش، بارش، پهنه هاي سيلابي، خاك شناسي، شاخص هاي قدرت جريان، ناهمواري زمين و فاصله از جاده، آبراهه، تاقديس و ناوديس آن استخراج گرديد. سپس الگوريتم هاي داده كاوي شامل درخت تصميم، جنگل تصادفي، تجميعي بوستينگ، ماشين بردارپشتيبان، رگرسيون لوجيستيك و شبكه عصبي در نرم‌افزار R جهت شناسايي ارزش متغيرها و روش هاي آماري دومتغيره شامل ارزش اطلاعات و تراكم سطح براي شناسايي ارزش كلاس هاي هر متغير در وقوع گسل برازش شد. ارزيابي دقت طبقه بندي الگوريتم ها با منحني ROC نشان داد الگوريتم هاي جنگل تصادفي و ماشين بردارپشتيبان به ترتيب با سطح زير منحني 88 و 86 درصد بهترين عملكرد را در طبقه بندي وقوع گسل بر مبناي متغيرهاي ورودي دارند. در نهايت با ضرايب جيني در الگوريتم جنگل تصادفي، نقشههاي پهنه بندي به دست آمده از تركيب اين الگوريتم با روشهاي آماري دومتغيره،تهيه و اعتبارسنجي شد. طبق اين الگوريتم، بهترتيب متغيرهاي ارتفاع، خاك شناسي و شاخص ناهمواري زمين مهم‌ترين پارامترها در وقوع گسل شناخته شدند. طبق ارزيابي نقشه هاي پهنه بندي، در روش ارزش اطلاعات و تراكم سطح به ترتيب حدود 52 و 35 درصد گسل ها در كلاس خطر خيلي زياد واقع شدند. لذا روش ارزش اطلاعات در تعيين پهنه هاي حساس به وقوع گسل دقت بيشتري داشت. با توجه به نتايج، روش هاي دادهكاوي به عنوان ابزاري مفيد در مديريت ريسك گسل معرفي گرديد. همچنين لزوم توجه به متغيرهاي محيطي بهويژه توپوگرافي در مراحل مديريت و تغيير كاربري حوضه ضرورت مي يابد.

چكيده لاتين

Fault is one of the main processes in tectonics which has relationship with phenomena such as earthquake. Therefore, awareness of the effect of environmental factors on fault occurrence and recognition of high risk areas is very important, that these goals in the present study have been investigated. The research area is Qara-Qum where after determining its faults, altitude, slope, direction, climate, land use, geomorphology, geology, erosion, precipitation, flood zones, pedology, stream power, topographic ruggedness index, distance from road, waterway, anticline and syncline were extracted. Then, data mining algorithms including of decision tree, random forest, cumulative, backing machine, logistic regression and neural network in R software are used to identify the value of variables and bivariate statistical methods including of information value and area density for identification of the values for each variable class fitted in fault occurrence. The accuracy of classification algorithms with ROC curve showed that based on input variables, random forest and support vector machine algorithms with 88% and 86% area under a curve had the best performance in classifying fault occurrence, respectively. Finally, according to the Gini coefficients in random forest algorithm, the zoning maps obtained by combining this algorithm were prepared and validated by bivariate statistical methods. According to this algorithm, the height, pedology and topographic ruggedness index variables, identified as the most important parameters in the fault occurrence respectively. Based on the zoning maps evaluation, information value and area density methods, around 52 and 35 percent of faults placed in very high risk class respectively. Therefore, the information value method was more accurate in identifying fault-sensitive zones. According to the results, data mining methods were introduced as a useful tool in fault risk management. It is also necessary to pay attention to environmental variables, especially topography, during the basin management and land use change stages.

سال انتشار

1398

عنوان نشريه

تحقيقات مرتع و بيابان ايران

فايل PDF

7899715

لينک به اين مدرک

https://search.isc.ac/dl/search/defaultta.aspx?DTC=8&DC=1134629