Next Question
RSS
It seems to me you would have to proceed on a per question basis. While Darcy and random's answer of dividing by the number of responses given by a respondent would be what you would want to do if you were asking about amount of time spent on the bus, your question asked "What do you typically use the bus for?"
This question does not care if someone rides the bus twice a month, once for work and once for school, or if they ride it to school in the morning and work in the afternoon every single day. So dividing to try to normalize data here would not get you useful information because the question was not asking about a quantity.
In this case, if you have few enough answers, a venn diagram would be an ideal way to present this information. Lets say that A = work B = school and C = shopping. The data that you want to show is the number of people in the group A, B and C as well as the people in the intersection of AB, BC, AC and ABC. If you have more than 3 a venn diagram is no longer the best way to present the data, but you should still be interested in the sets of people who are in A, B, C, D... and the intersection of those sets. (Intersections being the people who are in both A and B because they answered they typically use it for work and school)
You will find that later on when you are trying to draw conclusions based on this data that having this breakdown will be more useful than having a breakdown where the sets were normalized by dividing out the multiple answers.
Someone who typically uses the bus for work should be fully counted, for example, when trying to draw conclusions about if workplace reimbursement for bus passes helps increase ridership. They should also be fully counted for drawing conclusions about the benefits to merchants at the mall if they also use the bus for shopping. These two things are not in any way exclusive. Almost all people who shop, work. It would not make sense to try to normalize this.
However, if you did want to normalize it, lets say you had groups A, B and C (A being all people who answered work, even if they also answered school, be being people who answered school even if they also answered work...) and the subsets AB, AC, BC, ABC (people who gave more than one answer) and you want to normalize the A group. The equation would be this:
A - (AB + AC) + ABC + (AC - ABC)/2 + (AB - ABC)/2 + ABC/3
If you are using a program like SPSS to do your computations, it can do stuff like this automatically. Otherwise it would be fairly easy to set up an Excel document to do it for you as well.
Source(s):
Oregon State Stats 314 and Math 231 :P
Permalink | Report
Let me give you an example of why (but since I don't feel like getting out my calculator I am going to pretend that there is a survey of exactly 100 people and only 2 reasons to take the bus).
Say there are 100 busriders surveyed, and 50 of them say "I take the bus to get to work" as their only answer, and 20 say "I take the bus to go shopping" as their only answer, and 30 say "I take the bus to go shopping and to get to work."
What % of bus-riders take the bus to go to work? According to your survey, 80%. If you say, 65% (50 + 0.5*30), most people would think that is misleading. 80 people out of 100 said for work. Your audience probably would call this 80%.
Permalink | Report
Answered Question
M$5
January 14, 2009 03:31 AM
How should I process the answers from a recent survey I conducted?
In a project for my local bus transit department, I recently conducted a survey of bus riders and their use of mobile technology.
Although I requested each responder to provide the BEST answer for each question, multiple answers were given on some questions. A total of 229 documents were submitted. In summarizing the survey I am unsure how I should proceed with the questions that received more than one answer. For example, when asking "What do you typically use the bus for?" multiple answers were given totaling 356 responses as opposed to the 229.
Clearly, as I calculate percentages, I will get two different numbers for each option presented (work, shopping, school, etc...).
Should I use 229 across the board or should I calculate each questions numbers and base the percentages separately? Which method would give me the most accurate answers to my survey? Is there an alternate way you would suggest approaching this?
Thanks for your assistance.
Although I requested each responder to provide the BEST answer for each question, multiple answers were given on some questions. A total of 229 documents were submitted. In summarizing the survey I am unsure how I should proceed with the questions that received more than one answer. For example, when asking "What do you typically use the bus for?" multiple answers were given totaling 356 responses as opposed to the 229.
Clearly, as I calculate percentages, I will get two different numbers for each option presented (work, shopping, school, etc...).
Should I use 229 across the board or should I calculate each questions numbers and base the percentages separately? Which method would give me the most accurate answers to my survey? Is there an alternate way you would suggest approaching this?
Thanks for your assistance.
Interesting Question?
Yes (0)
No (0)
- In Public Transportation |
- |
- Report |
-
Share
RSS
Best Answer Chosen by Asker
| January 14, 2009 05:58 PM |
This question does not care if someone rides the bus twice a month, once for work and once for school, or if they ride it to school in the morning and work in the afternoon every single day. So dividing to try to normalize data here would not get you useful information because the question was not asking about a quantity.
In this case, if you have few enough answers, a venn diagram would be an ideal way to present this information. Lets say that A = work B = school and C = shopping. The data that you want to show is the number of people in the group A, B and C as well as the people in the intersection of AB, BC, AC and ABC. If you have more than 3 a venn diagram is no longer the best way to present the data, but you should still be interested in the sets of people who are in A, B, C, D... and the intersection of those sets. (Intersections being the people who are in both A and B because they answered they typically use it for work and school)
You will find that later on when you are trying to draw conclusions based on this data that having this breakdown will be more useful than having a breakdown where the sets were normalized by dividing out the multiple answers.
Someone who typically uses the bus for work should be fully counted, for example, when trying to draw conclusions about if workplace reimbursement for bus passes helps increase ridership. They should also be fully counted for drawing conclusions about the benefits to merchants at the mall if they also use the bus for shopping. These two things are not in any way exclusive. Almost all people who shop, work. It would not make sense to try to normalize this.
However, if you did want to normalize it, lets say you had groups A, B and C (A being all people who answered work, even if they also answered school, be being people who answered school even if they also answered work...) and the subsets AB, AC, BC, ABC (people who gave more than one answer) and you want to normalize the A group. The equation would be this:
A - (AB + AC) + ABC + (AC - ABC)/2 + (AB - ABC)/2 + ABC/3
If you are using a program like SPSS to do your computations, it can do stuff like this automatically. Otherwise it would be fairly easy to set up an Excel document to do it for you as well.
Source(s):
Oregon State Stats 314 and Math 231 :P
Permalink | Report
Other Answers (4)
January 14, 2009 03:56 AM
Yes, I see.
It will be tedious to go through each survey again and find the corrections but it would provide a 1 for 1 response.
Report
It will be tedious to go through each survey again and find the corrections but it would provide a 1 for 1 response.
January 14, 2009 09:46 AM
You should use 229 as the number; you should also make clear when reporting the statistics that respondants had the option of providing multiple answers (which explains why the percentages add up to more than 100). Let me give you an example of why (but since I don't feel like getting out my calculator I am going to pretend that there is a survey of exactly 100 people and only 2 reasons to take the bus).
Say there are 100 busriders surveyed, and 50 of them say "I take the bus to get to work" as their only answer, and 20 say "I take the bus to go shopping" as their only answer, and 30 say "I take the bus to go shopping and to get to work."
What % of bus-riders take the bus to go to work? According to your survey, 80%. If you say, 65% (50 + 0.5*30), most people would think that is misleading. 80 people out of 100 said for work. Your audience probably would call this 80%.
Permalink | Report
Answer this Question
Related Questions
Ask a Question
Buy Mahalo Dollars with Credit Card or PayPal
Top Members
Most Popular Tags
Categories
- Anonymous
- Arts & Design
- Beauty & Style
- Books & Authors
- Business
- Cars & Transportation
- Consumer Electronics
- Coupons Deals
- Education
- Entertainment
- Environment
- Fitness
- Food & Drink
- From Email
- From Iphone
- From Twitter
- Health
- History
- Hobbies
- Home & Garden
- How Tos
- Humor
- Jobs
- Legal
- Local
- Love & Relationships
- Mahalo Answers Community
- Money
- Music
- News
- NSFW
- Parenting
- Pets
- Science & Mathematics
- Services
- Shopping
- Social Science
- Society & Culture
- Sports
- Technology & Internet
- Travel
- Video Games
Welcome New Members
- boujame, November 29, 2009 10:05 PM
- conundrum_gunsl..., November 29, 2009 10:05 PM
- luck1ng, November 29, 2009 10:04 PM
- adelineromaille..., November 29, 2009 10:02 PM
- nicegirl42, November 29, 2009 09:43 PM
Mahalo Dollars are the currency of Mahalo Answers.
Each Mahalo Dollar costs $1.
Once you earn more than 40 Mahalo Dollars, you can request to be paid via PayPal. Each Mahalo Dollar is currently worth $0.75 when paid out via PayPal. Learn More
Thanks for the detailed answer.