攻击行为估计---基于神经网络的人体分割与行为识别研究综述(IJEME-V9-N1-2)

I.J.EducationandManagementEngineering,2019,1,9-19PublishedOnlineJanuary2019inMECS()DOI:10.5815/ijeme.2019.01.02Availableonlineat:AComprehensiveReviewonNeuralNetworkBasedHumanSegmentationandActionRecognitionA.F.M.SaifuddinSaifa,Md.AkibShahriarKhana,AbirMohammadHadia,RahulPrashadKarmokera,JoyJulianGomesaaFacultyofScienceandTechnology,AmericanInternationalUniversity–Bangladesh(AIUB),Dhaka,BangladeshReceived:11October2018;Accepted:17December2018;Published:08January2019AbstractHumanactionrecognitionhasbeenatalkedtopicsincemachinevisionwascoined.Withtheadventofneuralnetworksanddeeplearningmethods,variousarchitecturesweresuggestedtoaddresstheproblemswithinacontext.Convolutionalneuralnetworkhasbeentheprimarygo-toarchitectureforimagesegmentation,flowestimationandactionrecognitioninrecentdays.Astheproblemitselfisanextendedversionofvarioussub-problems,suchasframesegmentation,spatialandtemporalfeatureextraction,motionmodelingandactionclassificationasawhole,somemethodsreviewedinthispaperaddressedsub-problemsandsometriedtoaddressasinglearchitecturetotheactionrecognitionproblem.Whilebeingasuccess,convolutionneuralnetworkshavedrawbacksinitspoolingmethods.CapsNet,ontheotherhand,usessquashingfunctiontodeterminetheactivation.AlsoitaddressesspatiotemporalinformationwiththenormalizedvectormapswhileCNN-basedmethodsextractsfeaturemapforspatialandtemporalinformationandlateraugmenttheminafusionlayerforcombiningtwoseparatefeaturemaps.Criticalreviewofpapersprovidedinthisworkcancontributesignificantlyinaddressinghumanactionrecognitionproblemasawhole.IndexTerms:CapsuleNetwork,NeuralNetwork,ImageSegmentation,FlowEstimation,ActionRecognition.©2019PublishedbyMECSPublisher.Selectionand/orpeerreviewunderresponsibilityoftheResearchAssociationofModernEducationandComputerScience.1.IntroductionActionrecognitioninvideoscanhavearadicalimpactonhumanlife.Numerousattemptshavebeentakentosolvetheactionrecognitionchallenges.Duetohugecollaborativeeffortsincomputervisioncommunity,*Correspondingauthor:E-mailaddress:saif@aiub.edu,akeeebkhan@gmail.com,abir45pro@gmail.com,karmoker.rahul4@gmail.com,joyjuliangomes@gmail.com10AggressiveActionEstimation:AComprehensiveReviewonNeuralNetworkBasedHumanSegmentationandActionRecognitionsimpleactionsofwaving,standingetc.fromKTHandWeizmanndatasetarenowconsideredasobsoletechallengesandthecommunityhasmovedontosolvemorecomplexactionslikesportsandhumaninteractions.However,despitehavingthepotentialtoimprovesecurityandsurveillanceapplications,therehasnotbeenmuchimprovementinregardtotheviolentsceneandaggressivebehaviordetectionwhichisaspecialcaseofactionrecognition.Previousworksonactionrecognitionheavilyreliedontheusageofhard-codedtechniquessuchasMoSIFT,OpticalFlowandDenseTrajectory.Thesehard-codedtechniquesarecomputationallyexpensivewhileofferinglowperformance.InrecenttimesafterthesuccessofAlexNet,awaveofworksapproachedtheproblemfromanewviewpointusingconvolutionalneuralnetworks.Thoughbeingincisiveinimageclassificationtasks,ConvolutionalNeuralNetworksdidnotfarewellimmediatelyagainstalreadyestablishedmethodsinactionrecognition.DifferenttypesoffusiontechniquesusingbothdensefeaturesandCNNimprovesperformance.Two-streamnetworksand3D-CNNusingmotionfeaturessuchasopticalflowandRNNinconjunctionwiththeaforementionedtechniquesalsogaveaboostinperformance.Buttheseapproacheshavesomeseveredisadvantageslikemax-poolingwhichinmostcasessuppressestinybutimportantfeaturesand,susceptibletoadversarialattacks.Thoughthesemethodswork,theydonotprovideanyinsightintohowtheinnermechanismfunctions.ThenewlyproposedCapsNetarchitecturecanhelptobridgethegapasthisparticularsystemfollowsapart-to-wholeapproachandproducesvectoroutputs,unlikeCNNwhichhasscalaroutputs.Capsulesareparticularlygoodathandlingdifferenttypesofvisualstimulusandencodingthingslikepose(position,size,andorientation),deformation,velocity,albedo,hue,textureetc.thatisnotpossibleforCNN.Capsulesencapsulateall-importantinformationaboutthestateofthefeaturetheyaredetectinginvectorform.Therestofthepaperisorganizedasfollows.Section2discussesthechallengesofactionrecognitionandprovidesaconciseviewofabroadrangeoftechnologiesandapproachesthatareusedtosolvetheproblem.Insection3numerousmethodsrelatedtoactionrecognitionarereviewed.Section4elaboratestheframeworksusedinthemethoddescribedinsection3.Section5providesdetailsontheexperimentalsettingsandperformanceofthemethods.Insection6keyfindingsfromthemethodsaresummarized.Section7concludesthepaperemphasizingtheimpactoftheproblem.2.CoreBackgroundStudyHumanactionrecognitionisanintegralprobleminspatiotemporalinformationextraction,fusion,learninganddetectionfromvideostreams,bothinstaticandespeciallyinalivefeedanalysis.Numerousstudieshavebeenconductedbasedonhard-codedfeatureextraction,poseestimation,frameanddynamicsandalsoasneuralnetworklearningproblem.Theproblemindiscussionisaddressedbysub-problemsthatinclude:framepreprocessing(ifany),featureextraction(bothspatialandtemporal),learningthefeat

攻击行为估计---基于神经网络的人体分割与行为识别研究综述(IJEME-V9-N1-2)

免费阅读已结束，点击付费阅读剩下 ... 页

阅读已结束，您可以下载文档离线阅读

OPPO A100产品教材

Powerpoint2003基础教程(已修改)

增强版AVRmega16与mega32开发板使用手册

摄影比赛电子评统

全国XXXX年1月自学考试机械制造试题

家居装修全过程十二要素

第13讲ppt-上海理工大学—光电信息与计算机工程学院

准许在检验检疫系统内使用的卫生处理药品器械名单doc-序

酒店档案借阅单

“社会学”含义、起源、历史与发展

相关文档

相关搜索