Machine perception: Difference between revisions

From HandWiki
imported>Steve Marsio
simplify
 
linkage
 
Line 1: Line 1:
{{Short description|Computer ability}}
{{Short description|Computer ability}}
'''Machine perception''' is the capability of a computer system to interpret [[Data|data]] in a manner that is similar to the way [[Biology:Human|humans]] use their [[Philosophy:Sense|senses]] to relate to the world around them.<ref name="Tatum"/><ref name="Serov"/><ref>{{Cite web|url=http://www.ccs.fau.edu/~hahn/mpcr/|title=Machine Perception & Cognitive Robotics Laboratory|website=www.ccs.fau.edu|access-date=2016-06-18}}</ref> The basic method that the [[Computer|computers]] take in and respond to their [[Physics:Environment (systems)|environment]] is through the attached [[Engineering:Electronic hardware|hardware]].  Until recently input was limited to a keyboard, or a mouse, but advances in technology, both in hardware and [[Software|software]], have allowed computers to take in [[Biology:Sensory nervous system|sensory]] input in a way similar to humans.<ref name="Tatum"/><ref name="Serov"/>
'''Machine perception''' is the capability of a computer system to interpret [[Data|data]] in a manner that is similar to the way [[Biology:Human|human]]s use their [[Philosophy:Sense|sense]]s to relate to the world around them.<ref name="Tatum"/><ref name="Serov"/><ref>{{Cite web|url=http://www.ccs.fau.edu/~hahn/mpcr/|title=Machine Perception & Cognitive Robotics Laboratory|website=www.ccs.fau.edu|access-date=2016-06-18}}</ref> The basic method that the [[Computer|computer]]s take in and respond to their [[Physics:Environment (systems)|environment]] is through the attached [[Engineering:Electronic hardware|hardware]].  Until recently [[Input (computer science)|input]] was limited to a keyboard, or a mouse, but advances in technology, both in hardware and [[Software|software]], have allowed computers to take in [[Biology:Sensory nervous system|sensory]] input in a way similar to humans.<ref name="Tatum"/><ref name="Serov"/>


Machine [[Perception|perception]] allows the computer to use this sensory input, as well as conventional [[Computational science|computational]] means of gathering [[Information|information]], to gather information with greater accuracy and to present it in a way that is more comfortable for the [[User (computing)|user]].<ref name="Tatum"/> These include [[Computer vision|computer vision]], machine hearing, machine touch, and machine smelling, as artificial [[Biology:Sense of smell|scents]] are, at a [[Chemistry:Chemical compound|chemical compound]], [[Physics:Molecule|molecular]], [[Physics:Atom|atomic]] level, indiscernible and [[Identical particles|identical.]]<ref>{{Cite web |last=Cotton2009-03-01T00:00:00+00:00 |first=Simon |title=If it smells - it's chemistry |url=https://edu.rsc.org/feature/if-it-smells-its-chemistry/2020168.article |access-date=2022-05-03 |website=RSC Education |language=en}}</ref><ref>{{Cite web |title=Artificial networks learn to smell like the brain |url=https://news.mit.edu/2021/artificial-networks-learn-smell-like-the-brain-1018 |access-date=2022-05-03 |website=MIT News {{!}} Massachusetts Institute of Technology |language=en}}</ref>
Machine [[Perception|perception]] allows the computer to use this sensory input, as well as conventional [[Computational science|computational]] means of gathering [[Information|information]], to gather information with greater accuracy and to present it in a way that is more comfortable for the [[User (computing)|user]].<ref name="Tatum"/> These include [[Computer vision|computer vision]], machine hearing, machine touch, and machine smelling, as artificial [[Biology:Sense of smell|scents]] are, at a [[Chemistry:Chemical compound|chemical compound]], [[Physics:Molecule|molecular]], [[Atom|atom]]ic level, indiscernible and [[Identical particles|identical.]]<ref>{{cite web |last=Cotton |first=Simon |date=2009-03-01 |title=If it smells - it's chemistry |url=https://edu.rsc.org/feature/if-it-smells-its-chemistry/2020168.article |access-date=2022-05-03 |website=RSC Education }}</ref><ref>{{Cite web |title=Artificial networks learn to smell like the brain |url=https://news.mit.edu/2021/artificial-networks-learn-smell-like-the-brain-1018 |access-date=2022-05-03 |website=MIT News {{!}} Massachusetts Institute of Technology |date=18 October 2021 |language=en}}</ref>


The end goal of machine perception is to give machines the ability to [[Philosophy:Visual perception|see]], [[Philosophy:Feeling|feel]] and perceive the world as humans do and therefore for them to be able to [[Explainable artificial intelligence|explain]] in a human way why they are making their decisions, to warn us when it is failing and more importantly, the reason why it is failing.<ref>{{cite web|url=https://www.ece.vt.edu/research/area/perception|title=Machine Perception Research - ECE - Virginia Tech|website=www.ECE.VT.edu|access-date=January 10, 2018|archive-date=March 7, 2021|archive-url=https://web.archive.org/web/20210307031656/https://ece.vt.edu/research/area/perception|url-status=dead}}</ref> This [[Philosophy:Goal|purpose]] is very similar to the proposed purposes for [[Artificial intelligence|artificial intelligence]] generally, except that machine perception would only grant machines limited [[Philosophy:Sentience|sentience]], rather than bestow upon machines full [[Philosophy:Consciousness|consciousness]], [[Philosophy:Self-awareness|self-awareness]], and [[Philosophy:Intentionality|intentionality]].
The end goal of machine perception is to give machines the ability to [[Philosophy:Visual perception|see]], [[Philosophy:Feeling|feel]] and perceive the world as humans do and therefore for them to be able to [[Explainable artificial intelligence|explain]] in a human way why they are making their decisions, to warn us when it is failing and more importantly, the reason why it is failing.<ref>{{cite web|url=https://www.ece.vt.edu/research/area/perception|title=Machine Perception Research - ECE - Virginia Tech|website=www.ECE.VT.edu|access-date=January 10, 2018|archive-date=March 7, 2021|archive-url=https://web.archive.org/web/20210307031656/https://ece.vt.edu/research/area/perception}}</ref> This [[Philosophy:Goal|purpose]] is very similar to the proposed purposes for [[Artificial intelligence|artificial intelligence]] generally, except that machine perception would only grant machines limited [[Philosophy:Sentience|sentience]], rather than bestow upon machines full [[Philosophy:Consciousness|consciousness]], [[Philosophy:Self-awareness|self-awareness]], and [[Philosophy:Intentionality|intentionality]].


==Machine vision==
==Machine vision==
{{main|Machine vision}}
[[Computer vision]] is a field that includes methods for acquiring, processing, analyzing, and understanding images and high-dimensional data from the real world to produce numerical or symbolic information, e.g., in the forms of decisions.  Computer vision has many applications already in use today such as [[Facial recognition system|facial recognition]], geographical modeling, and even aesthetic judgment.<ref name="Dhar-Ordonez-Berg"/>
[[Computer vision]] is a field that includes methods for acquiring, processing, analyzing, and understanding images and high-dimensional data from the real world to produce numerical or symbolic information, e.g., in the forms of decisions.  Computer vision has many applications already in use today such as [[Facial recognition system|facial recognition]], geographical modeling, and even aesthetic judgment.<ref name="Dhar-Ordonez-Berg"/>


However, machines still struggle to interpret visual impute accurately if said impute is blurry, and if the [[Philosophy:Recognition-by-components theory#Viewpoint Variance|viewpoint]] at which stimulus are viewed varies often. Computers also struggle to determine the proper nature of some stimulus if overlapped by or seamlessly touching another stimulus. This refers to The [[Physics:Ambiguous image#Good continuation|Principle of Good Continuation]]. Machines also struggle to perceive and record stimulus functioning according to the Apparent Movement principle which [[Philosophy:Gestalt psychology|Gestalt psychologists]] researched.
However, machines still struggle to interpret visual impute accurately if it is blurry or if the [[Philosophy:Recognition-by-components theory#Viewpoint Variance|viewpoint]] at which stimuli are viewed varies often. Computers also struggle to determine the proper nature of some stimulus if overlapped by or seamlessly touching another stimulus. This refers to the [[Physics:Ambiguous image#Good continuation|Principle of Good Continuation]]. Machines also struggle to perceive and record stimulus functioning according to the Apparent Movement principle which is a field of research in [[Philosophy:Gestalt psychology|Gestalt psychology]].


==Machine hearing==
==Machine hearing==


Machine hearing, also known as machine listening or [[Computer audition|computer audition]], is the ability of a computer or machine to take in and process sound data such as speech or music.<ref name="Tanguiane1993">{{Cite book
Machine hearing, also known as machine listening or [[Computer audition|computer audition]] is the ability of a computer or machine to take in and process sound data such as speech or music.<ref name="Tanguiane1993">{{Cite book
|last=Tanguiane ([[Biography:Andranik Tangian|Tangian]])
|last=Tanguiane ([[Biography:Andranik Tangian|Tangian]])
|first=Andranick
|first=Andranick
Line 27: Line 26:
|jstor=40285634
|jstor=40285634
}}</ref>
}}</ref>
This area has a wide range of application including music recording and compression, speech synthesis, and speech recognition.<ref name="Lyon"/>
This area has a wide range of application including music recording and compression, speech synthesis and [[Speech recognition|speech recognition]].<ref name="Lyon"/>
Moreover, this technology allows the machine to replicate the human brain's ability to selectively focus on a specific sound against many other competing sounds and background noise. This particular ability is called [[Physics:Auditory scene analysis|auditory scene analysis]]. The technology enables the machine to segment several streams occurring at the same time.<ref name="Tanguiane1993" /><ref name="Tangian2001">{{Cite journal
Moreover, this technology allows the machine to replicate the human brain's ability to selectively focus on a specific sound against many other competing sounds and background noise. This ability is called "[[Physics:Auditory scene analysis|auditory scene analysis]]". The technology enables the machine to segment several streams occurring at the same time.<ref name="Tanguiane1993" /><ref name="Tangian2001">{{Cite journal
|last=Tangian |first=Andranik
|last=Tangian |first=Andranik
|year=2001  
|year=2001  
Line 37: Line 36:
|doi=10.5445/IR/1000133287|s2cid=237995668
|doi=10.5445/IR/1000133287|s2cid=237995668
}}</ref><ref>{{cite web|title=Machine Perception & Cognitive Robotics Laboratory|url=http://www.ccs.fau.edu/~hahn/mpcr/|website=ccs.FAU.edu|access-date=January 10, 2018}}</ref>
}}</ref><ref>{{cite web|title=Machine Perception & Cognitive Robotics Laboratory|url=http://www.ccs.fau.edu/~hahn/mpcr/|website=ccs.FAU.edu|access-date=January 10, 2018}}</ref>
Many commonly used devices such as a smartphones, voice translators, and cars make use of some form of machine hearing. Present technology still occasionally struggles with [[Speech segmentation|speech segmentation]] though. This means hearing words within sentences, especially when human accents are accounted for.
Many commonly used devices such as a smartphones, voice translators and cars make use of some form of machine hearing. Present technology still has challenges in [[Speech segmentation|speech segmentation]]. This means it is occasionally unable to correctly split words within sentences especially when spoken in an atypical accent.


==Machine touch{{anchor|Machine touch}}==
==Machine touch{{anchor|Machine touch}}==
[[File:SynTouch BioTac.jpg|thumb|left|A [[Engineering:Tactile sensor|tactile sensor]]]]
[[File:SynTouch BioTac.jpg|thumb|left|A [[tactile sensor]]]]
Machine touch is an area of machine perception where tactile information is processed by a machine or computer.  Applications include [[Engineering:Tactile sensor|tactile perception]] of surface properties and dexterity whereby tactile information can enable intelligent reflexes and interaction with the environment.<ref>{{cite journal|title=Learning efficient haptic shape exploration with a rigid tactile sensor array, S. Fleer, A. Moringen, R. Klatzky, H. Ritter |year=2020 |doi=10.1371/journal.pone.0226880 |pmid=31896135 |doi-access=free |last1=Fleer |first1=S. |last2=Moringen |first2=A. |last3=Klatzky |first3=R. L. |last4=Ritter |first4=H. |journal=PLOS ONE |volume=15 |issue=1 |pages=e0226880 |pmc=6940144 }}</ref> (This could possibly be done through measuring when and where friction occurs, and of what nature and intensity the friction is). Machines however still do not have any way of measuring some physical human experiences we consider ordinary, including physical pain. For example, scientists have yet to invent a mechanical substitute for the [[Biology:Nociceptor|Nociceptor]]s in the body and brain that are responsible for noticing and measuring physical human discomfort and suffering.
Machine touch is an area of machine perception where tactile information is processed by a machine or computer.  Applications include [[Engineering:Tactile sensor|tactile perception]] of surface properties and dexterity whereby tactile information can enable intelligent reflexes and interaction with the environment.<ref>{{cite journal|title=Learning efficient haptic shape exploration with a rigid tactile sensor array, S. Fleer, A. Moringen, R. Klatzky, H. Ritter |year=2020 |doi=10.1371/journal.pone.0226880 |pmid=31896135 |doi-access=free |last1=Fleer |first1=S. |last2=Moringen |first2=A. |last3=Klatzky |first3=R. L. |last4=Ritter |first4=H. |journal=PLOS ONE |volume=15 |issue=1 |article-number=e0226880 |pmc=6940144 }}</ref> Though this could possibly be done through measuring when and where friction occurs and also the nature and intensity of the friction, machines however still do not have any way of measuring few ordinary physical human experiences including physical pain. For example, scientists have yet to invent a mechanical substitute for the [[Biology:Nociceptor|Nociceptor]]s in the body and brain that are responsible for noticing and measuring physical human discomfort and suffering. Researchers are exploring ways to combine tactile sensors with machine learning algorithms to approximate human-like touch perception, enabling robots to handle delicate objects, detect texture differences, or respond adaptively to environmental changes. Emerging applications include robotic surgery , prosthetics with sensory feedback, and haptic interfaces in virtual reality, which aim to provide more immersive and precise interaction between humans and machines.


==Machine olfaction==
==Machine olfaction==
Line 71: Line 70:
* [[Engineering:Sensor|Sensor]]s
* [[Engineering:Sensor|Sensor]]s
* [[Simultaneous localization and mapping|SLAM]]
* [[Simultaneous localization and mapping|SLAM]]
*[[History of artificial intelligence|History of Artificial Intelligence]]
*[[History of artificial intelligence]]


==References==
==References==

Latest revision as of 07:33, 15 April 2026

Short description: Computer ability

Machine perception is the capability of a computer system to interpret data in a manner that is similar to the way humans use their senses to relate to the world around them.[1][2][3] The basic method that the computers take in and respond to their environment is through the attached hardware. Until recently input was limited to a keyboard, or a mouse, but advances in technology, both in hardware and software, have allowed computers to take in sensory input in a way similar to humans.[1][2]

Machine perception allows the computer to use this sensory input, as well as conventional computational means of gathering information, to gather information with greater accuracy and to present it in a way that is more comfortable for the user.[1] These include computer vision, machine hearing, machine touch, and machine smelling, as artificial scents are, at a chemical compound, molecular, atomic level, indiscernible and identical.[4][5]

The end goal of machine perception is to give machines the ability to see, feel and perceive the world as humans do and therefore for them to be able to explain in a human way why they are making their decisions, to warn us when it is failing and more importantly, the reason why it is failing.[6] This purpose is very similar to the proposed purposes for artificial intelligence generally, except that machine perception would only grant machines limited sentience, rather than bestow upon machines full consciousness, self-awareness, and intentionality.

Machine vision

Computer vision is a field that includes methods for acquiring, processing, analyzing, and understanding images and high-dimensional data from the real world to produce numerical or symbolic information, e.g., in the forms of decisions. Computer vision has many applications already in use today such as facial recognition, geographical modeling, and even aesthetic judgment.[7]

However, machines still struggle to interpret visual impute accurately if it is blurry or if the viewpoint at which stimuli are viewed varies often. Computers also struggle to determine the proper nature of some stimulus if overlapped by or seamlessly touching another stimulus. This refers to the Principle of Good Continuation. Machines also struggle to perceive and record stimulus functioning according to the Apparent Movement principle which is a field of research in Gestalt psychology.

Machine hearing

Machine hearing, also known as machine listening or computer audition is the ability of a computer or machine to take in and process sound data such as speech or music.[8][9] This area has a wide range of application including music recording and compression, speech synthesis and speech recognition.[10] Moreover, this technology allows the machine to replicate the human brain's ability to selectively focus on a specific sound against many other competing sounds and background noise. This ability is called "auditory scene analysis". The technology enables the machine to segment several streams occurring at the same time.[8][11][12] Many commonly used devices such as a smartphones, voice translators and cars make use of some form of machine hearing. Present technology still has challenges in speech segmentation. This means it is occasionally unable to correctly split words within sentences especially when spoken in an atypical accent.

Machine touch

A tactile sensor

Machine touch is an area of machine perception where tactile information is processed by a machine or computer. Applications include tactile perception of surface properties and dexterity whereby tactile information can enable intelligent reflexes and interaction with the environment.[13] Though this could possibly be done through measuring when and where friction occurs and also the nature and intensity of the friction, machines however still do not have any way of measuring few ordinary physical human experiences including physical pain. For example, scientists have yet to invent a mechanical substitute for the Nociceptors in the body and brain that are responsible for noticing and measuring physical human discomfort and suffering. Researchers are exploring ways to combine tactile sensors with machine learning algorithms to approximate human-like touch perception, enabling robots to handle delicate objects, detect texture differences, or respond adaptively to environmental changes. Emerging applications include robotic surgery , prosthetics with sensory feedback, and haptic interfaces in virtual reality, which aim to provide more immersive and precise interaction between humans and machines.

Machine olfaction

Scientists are developing computers known as machine olfaction which can recognize and measure smells as well. Airborne chemicals are sensed and classified with a device sometimes known as an electronic nose.[14][15]

Machine taste

Future

Other than those listed above, some of the future hurdles that the science of machine perception still has to overcome include, but are not limited to:

- Embodied cognition - The theory that cognition is a full body experience, and therefore can only exist, and therefore be measure and analyzed, in fullness if all required human abilities and processes are working together through a mutually aware and supportive systems network.

- The Moravec's paradox (see the link)

- The Principle of similarity - The ability young children develop to determine what family a newly introduced stimulus falls under even when the said stimulus is different from the members with which the child usually associates said family with. (An example could be a child figuring that a chihuahua is a dog and house pet rather than vermin.)

- The Unconscious inference: The natural human behavior of determining if a new stimulus is dangerous or not, what it is, and then how to relate to it without ever requiring any new conscious effort.

- The innate human ability to follow the likelihood principle in order to learn from circumstances and others over time.

- The recognition-by-components theory - being able to mentally analyze and break even complicated mechanisms into manageable parts with which to interact with. For example: A person seeing both the cup and the handle parts that make up a mug full of hot cocoa, in order to use the handle to hold the mug so as to avoid being burned.

- The free energy principle - determining long before hand how much energy one can safely delegate to being aware of things outside one's self without the loss of the needed energy one requires for sustaining their life and function satisfactorily. This allows one to become both optimally aware of the world around them self without depleting their energy so much that they experience damaging stress, decision fatigue, and/or exhaustion.

See also

References

  1. 1.0 1.1 1.2 1.3 Malcolm Tatum (October 3, 2012). "What is Machine Perception". http://www.wisegeek.com/what-is-machine-perception.htm. 
  2. 2.0 2.1 2.2 Alexander Serov (January 29, 2013). "Subjective Reality and Strong Artificial Intelligence". arXiv:1301.6359 [cs.AI].
  3. "Machine Perception & Cognitive Robotics Laboratory". http://www.ccs.fau.edu/~hahn/mpcr/. 
  4. Cotton, Simon (2009-03-01). "If it smells - it's chemistry". https://edu.rsc.org/feature/if-it-smells-its-chemistry/2020168.article. 
  5. "Artificial networks learn to smell like the brain" (in en). 18 October 2021. https://news.mit.edu/2021/artificial-networks-learn-smell-like-the-brain-1018. 
  6. "Machine Perception Research - ECE - Virginia Tech". https://www.ece.vt.edu/research/area/perception. 
  7. 7.0 7.1 Dhar, Sagnik; Ordonez, Vicente; Berg, Tamara L. (2011). "High level describable attributes for predicting aesthetics and interestingness". CVPR 2011. pp. 1657–1664. doi:10.1109/CVPR.2011.5995467. ISBN 978-1-4577-0394-2. https://dspace.sunyconnect.suny.edu/bitstream/handle/1951/55408/Dhar_grad.sunysb_0771M_10339.pdf?sequence=1. 
  8. 8.0 8.1 Tanguiane (Tangian), Andranick (1993). Artificial Perception and Music Recognition. Berlin-Heidelberg: Springer. 
  9. Tanguiane (Tangian), Andranick (1994). "Principle of correlativity of perception and its applications to music recognition". Music Perception 11 (4): 465–502. doi:10.2307/40285634. 
  10. 10.0 10.1 Lyon, Richard (2010). "Machine Hearing: An Emerging Field [Exploratory DSP". IEEE Signal Processing Magazine 27 (5): 131–139. doi:10.1109/MSP.2010.937498. Bibcode2010ISPM...27..131L. 
  11. Tangian, Andranik (2001). "How do we think: modeling interactions of memory and thinking". Cognitive Processing 2: 117–151. doi:10.5445/IR/1000133287. https://publikationen.bibliothek.kit.edu/1000133287. 
  12. "Machine Perception & Cognitive Robotics Laboratory". http://www.ccs.fau.edu/~hahn/mpcr/. 
  13. Fleer, S.; Moringen, A.; Klatzky, R. L.; Ritter, H. (2020). "Learning efficient haptic shape exploration with a rigid tactile sensor array, S. Fleer, A. Moringen, R. Klatzky, H. Ritter". PLOS ONE 15 (1). doi:10.1371/journal.pone.0226880. PMID 31896135. 
  14. "Using artificial intelligence to smell the roses: Study applies machine learning to olfaction with possible vast applications in flavors and fragrances" (in en). https://www.sciencedaily.com/releases/2020/07/200728182544.htm. 
  15. Marr, Bernard. "Artificial Intelligence Is Developing A Sense Of Smell: What Could A Digital Nose Mean In Practice?" (in en). https://www.forbes.com/sites/bernardmarr/2021/05/10/artificial-intelligence-is-developing-a-sense-of-smell-what-could-a-digital-nose-mean-in-practice/. 
  16. Turk, Matthew (2000). "Perceptive Media: Machine Perception and Human Computer Interaction". Chinese Journal of Computers 12. http://cs.ucsb.edu/~mturk/Papers/PerceptiveMedia.pdf.  pages 1235-1244