Much of the research conducted in the psychological discipline nowadays can be termed as empirical research, whereby researchers use scientific methods, such as laboratory tests or fMRI scans, to investigate a phenomenon. The use of these methods is critical, as it provides scientists with empirical evidence that can be falsified, as opposed to non-scientific methods such as case studies and observations. Furthermore, these methods ensure that future researchers can easily replicate or revise the experiment to generate the best theory possible that applies to the human mind and behaviour. Without empirical reports, findings are unlikely to be fully trusted by other psychologists and the scientific world as a whole. However, as you will witness, despite the apparent strengths of empirical reports, problems with certain aspects of the experimentation procedure can arise. In spite of this, it is important to note that at this point in time, empirical reports are the strongest indicator of experimental effects in data and serve a vital role in the study of psychology
The strengths of empirical reports can be seen clearly in research conducted on obedience by Milgram (1963). It is an investigation that the majority of psychology students will be familiar with, as it well respected and the results are widely documented. The source of Milgram’s interest in obedience came from his observations of the behaviour of the Nazi Party in World War 11, and he began to question why members followed out the horrific tasks set by figures of authority.
To investigate his observation, Milgram recruited forty male participants who were told that the experimenters were exploring the effects of punishment on learning. The participants were paid a sum of $4.50 for their participation and were informed that they would still receive this even if they decided to withdraw from the experiment. Each participant was placed in a room with an investigator and were told that they would be the teacher for a learner who was sat in a different room; unknown to the participants, the learner was an actor. Each time the learner failed to get a question correct, the participant was asked to administer an electric shock, increasing the voltage level for each wrong answer. These were obviously fake electric shocks, but this again was unknown to the participant. The learner sat in silence for each electric shock administered until the voltage reached 300 volts, at which point the learner banged the wall and failed to answer the next question. When the voltage reached 315 volts, the learner repeated these actions, but from this moment forward, did not say or do anything. If the participant asked the experimenter if they could stop administering the electric shocks, the experimenter would have specified answers to state, such as “it is vital that you continue”. Milgram found that 65% of the participants continued to give electric shocks until the maximum voltage, which was considerably beyond the voltages marked as “danger: severe shock”. Only 12.5% of the participants ceased to give electric shocks past the 300 volts mark. The findings support the notion that people will obey authority figures, even if this means causing severe harm to others.
There are quite clearly ethical issues with this research conducted, as participants may have been left emotionally scarred with the knowledge that they could behave in such as manner towards another person. Additionally, although many experiments do require some form of deception in order to find supportive evidence, the extent of the deception used in this study did cause many raised eyebrows within the scientific community. Furthermore, despite being told at the beginning of the experiment that they would be allowed to withdraw at any time without penalty, the way the experimenter responded to participants’ pleas meant that it was difficult to do so.
Although these ethical issues are extremely important in terms of the moral implications of the experiment, the use of the empirical method is also important to analyse in order to be fully aware of the strengths and limitations in applying the findings to real life situations. One of the main problems with this research, and indeed many empirical reports published, is that the experiment was conducted in a laboratory setting, thus meaning it was extremely artificial. After all, in real life when would you ever be asked to sit in a room and give electric shocks to person who has not done anything to hurt you? This means that the experiment does lack external validity, as it is difficult to apply the findings to real life situations. The laboratory setting is also one which many people do not experience, thus again making the findings difficult to generalise. There are ways of overcoming this predicament such as conducting the experiment in different locations or setting up the laboratory to look like a normal everyday room, a classroom for example. This is often done in developmental psychology, whereby experimenters try to make a laboratory look more like a nursery to increase external validity. Moreover, the sample used were US men, thus raising the question of whether the findings from this research can be applied to the general population, perhaps in females the findings may be stronger or indeed weaker.
Another potential problem with the way that the experiment was conducted is that demand characteristics could have played a significant part of the experimental outcome. The majority of people with any ounce of common sense would probably assume that the electric shocks were fake, after all it would be illegal for them not to be. As a consequence, participants may have not taken the experiment seriously and just carried on increasing the voltage for the fun of it or to help the experimenter achieve the results that were expected. Orne and Holland (1968) further criticised the methods used stating that there may have been other reasons for participants behaviour such as assuming that the learner was no longer in the room after 315 volts. This would cause the findings to be dubious and lack internal validity.
In spite of these criticisms, there are also many strengths of the experimental method used by Milgram in order to publish an empirical report. Although the artificial setting of a laboratory can lead to problems with generalizability to real life situations, laboratory experiments are renowned for creating a substantial amount of control in an experiment, thus meaning that potential threats to internal validity are more likely to be avoided. Furthermore, perhaps the most important aspect of the amount of control and precision used in the investigation, means that it has been repeated many times across the world. Other researchers, and indeed Milgram himself, expanded the experiment to investigate other features affecting obedience. Features such as the proximity of the learner to the participant, the gender of the participant, an absent experimenter, and a shift in location, impacted on the levels of obedience to the experimenter shown by the participants. Without such as well controlled experimental method, these replications would have been difficult to achieve.
Milgram’s investigation into obedience clearly has extreme ethical issues, however for this discussion we will put these issues aside and focus on the method (although feel free to add your views on the ethical problems displayed by this and other research when commenting). As with many empirical reports, the methods used can often display certain problems that could potentially have a detrimental effect on the findings of the investigation. It seems that in most cases of experiments in laboratory settings, you have to sacrifice some external validity in order for the research to be internally valid. The fundamental question raised is what is the alternative? Would it really be realistic to use case studies or observations to go and explore such an intensive idea of obedience? The fact of the matter is that it would be almost impossible to carry out this research using a non-scientific approach. It would lack any form of empiricism, would have no internal validity whatsoever, and quite probably would not produce any significant findings. The methods used in empirical reports such as Milgram’s mean that scientists can replicate and refine important research; it means that compelling evidence is provided for others to critique, and most importantly it attempts the scientific studies of crucial difficulties in order to make the world a better place to live.
Orne, M.T., & Holland, C.H. (1968). The ecological validity of laboratory deceptions. International Journal of Psychiatry, 6. 282-293. : http://www.psych.upenn.edu/history/orne/orneholland1968ijp282293.html
Milgram, S. (1963). Behavioural study of obedience. Abnormal Social Psychology, 67. 371-378