بررسي ميزان مطابقت بازيابي محتوا با تصويرها و چالش هاي آن در مقاله هاي پايگاه‌هاي اطلاعاتي علمي

عنوان به زبان ديگر

Assessing Rate of Matching Content Retrieval Conformity with Images and its Challenges in Scientific Database Articles

پديد آورندگان

سليماني نژاد، عادل دانشگاه شهيد باهنر كرمان - بخش علم اطلاعات و دانش شناسي , درودي، فريبرز پژوهشگاه علوم و فناوري اطلاعات ايران ( ايرانداك ) , مرتضي پور، زينب دانشگاه شهيد باهنر كرمان

تعداد صفحه

از صفحه

173

تا صفحه

200

كليدواژه

بازيابي تصوير , بازيابي محتوا , مقاله هاي علمي , چالش هاي بازيابي تصوير , پايگاه اطلاعات علمي

چكيده فارسي

هدف از اين تحقيق بررسي ميزان مطابقت بازيابي محتوا با تصويرها و چالش‌هاي آن در مقاله‌هاي پايگاه‌هاي اطلاعاتي علمي است. روش پيمايشي توصيفي بوده و جامعه آماري آن پايگاه‌هاي اطلاعاتي علمي «ساينس دايركت»، «اسكوپوس»، «مدلاين» و «وب آو ساينس» است. ابتدا عنوان، چكيده، بيان مسئله و نتيجه‌گيري مقاله‌ها به‌صورت جداگانه وارد نرم‌افزار Extreme Picture Finderشد. سپس، تصويرهاي مرتبط با عنوان، چكيده، بيان مسئله و نتيجه‌گيري مقاله‌ها از نرم‌افزار مورد نظر استخراج گرديد. تصويرهاي استخراج‌شده وارد نرم‌افزار Visual Similarity Duplicate Image Finder گرديد تا مطابقت تصويرهاي استخراج‌شده از نرم‌افزار و تصويرهاي مقاله‌ها انجام شود. نتايج نشان داد كه بيشترين ميزان مطابقت محتوا با تصوير در پايگاه «وب‌‌آو‌‌ساينس» و كمترين ميزان مطابقت در پايگاه «ساينس ‌دايركت» وجود داشت. همچنين به‌ترتيب، بيشترين شباهت بين عنوانها، بيان مسئله، چكيده و نتيجه‌گيري با تصويرهاي مقاله‌ها در پايگاه‌ها وجود داشت. عدم رعايت استانداردهاي مصورسازي در تصويرهاي به‌كاررفته در مقاله هاي پايگاه‌هاي علمي چالشي جدي است. تصاويري كه از استاندارهاي مصورسازي (متن، رنگ، لبه، حاشيه و ...) به دور بودند، بازيابي نشدند و در صورت بازيابي از ميزان شباهت نازلي برخوردار بودند. عدم جامعيت قابليت‌هاي نرم‌افزارهاي مورد استفاده، چالش بعدي بود. مسئله ديگر عدم رعايت يا پيروي از يك شيوه‌نامه استاندارد در چيدمان تصويرها در مقاله‌هاست. برخي تصويرها بدون توجه به متن در مقاله‌ها درج شده‌اند. اين مورد در پايگاه «وب‌آو‌ساينس» كمتر، اما در ساير پايگاه هاي مورد بررسي بسيار مشاهده گرديد. با توجه به ابداع روش‌هاي جديد بازيابي تصوير و محتوا، نتيجه اين بررسي نشان مي‌دهد كه عدم شباهت بين تصويرهاي بازيابي‌شده با محتوا در مقالات پايگاه‌هاي معتبر علمي چشمگير و گمراه‌ كننده است.

چكيده لاتين

The purpose of this research is to match the content retrieval with images in the articles of scientific databases. The research method is a correlation-based procedure and the statistical population of the present study is the scientific databases such as Science Direct, Scopus, Medline and Web of Science. Six papers were extracted from each base along with a separate issue. First, the title, abstract, problem statement and conclusion of the articles were inserted into the software Etreme Picture Finder, separately. Then, the images related to title, abstract, problem statement and conclusions of the articles were extracted from the relevant software. The images extracted were inserted into the Visual Similarity Duplicate Image Finder software. The results show that the highest level of content matching was found with the images in the WebAsiensis database and the least amount of match in the database of the SinjServer. There were also the most similarities between the titles, the problem statement, the abstract and the conclusions with the images of the articles in the bases. Failure to observe the standards of visualization in the images used in scientific articles was a serious challenge. Images that have been removed from image editors (text, color, edges, margins, etc.) have not been recovered, and if recovered, they have had a fairly similar resemblance. The lack of comprehensiveness of the superficial capabilities used was the next challenge. Another problem is the non-observance of or compliance with a standard style sheet in the layout of images in the articles. Some images have been brought to the articles regardless of the text. This item was found to be smaller in the Web site, but was seen in many other sites.

سال انتشار

1398

عنوان نشريه

پژوهش نامه پردازش و مديريت اطلاعات

فايل PDF

8063001

لينک به اين مدرک

https://search.isc.ac/dl/search/defaultta.aspx?DTC=8&DC=1137898