Next Question
RSS
No Best Answer Selected
They won't be truly "random" in that there is likely to be some relation between the products, but it's a start. Run the crawler long enough and you'll have a big enough sample that you can then sample *that* and it'll feel more random.
Permalink | Report
Question
January 15, 2009 04:12 AM
How can I get a list of random ASIN numbers?
ASIN is a unique identifier that Amazon uses for it's products. I would like a way to collect a bunch of random ASIN's for analysis. I tried creating random strings but each ASIN is different enough that a brute force guess and check isn't practical.
Interesting Question?
Yes (0)
No (0)
Email to a friend |
RSS
No Best Answer Selected
Answers (1)
January 15, 2009 04:22 AM
How about a quick crawler/scraper? The code is not hard to get a web page, find the links (<a href> tags), look for tags of a certain format representing products, then parse out the ASIN. Then repeat for a random link on each page that fits your criteria. They won't be truly "random" in that there is likely to be some relation between the products, but it's a start. Run the crawler long enough and you'll have a big enough sample that you can then sample *that* and it'll feel more random.
Permalink | Report
Answer this Question
Related Questions
Ask a Question
Buy Mahalo Dollars with Credit Card or PayPal
Top Members
Most Popular Tags
Categories
- Anonymous
- Arts & Design
- Beauty & Style
- Books & Authors
- Business
- Cars & Transportation
- Consumer Electronics
- Coupons Deals
- Education
- Entertainment
- Environment
- Fitness
- Food & Drink
- From Email
- From Iphone
- From Twitter
- Health
- History
- Hobbies
- Home & Garden
- How Tos
- Humor
- Jobs
- Legal
- Local
- Love & Relationships
- Mahalo Answers Community
- Money
- Music
- News
- NSFW
- Parenting
- Pets
- Science & Mathematics
- Services
- Shopping
- Social Science
- Society & Culture
- Sports
- Technology & Internet
- Travel
- Video Games
Welcome New Members
- aquamaniac100, November 30, 2009 11:11 PM
- halmstad123, November 30, 2009 11:01 PM
- sverige123, November 30, 2009 10:59 PM
- conundrum_barba..., November 30, 2009 10:54 PM
- franklanfranco, November 30, 2009 10:53 PM
Mahalo Dollars are the currency of Mahalo Answers.
Each Mahalo Dollar costs $1.
Once you earn more than 40 Mahalo Dollars, you can request to be paid via PayPal. Each Mahalo Dollar is currently worth $0.75 when paid out via PayPal. Learn More
If you've got any shell programming skills, get yourself a copy of the "wget" program, which automatically pulls down html pages to your file system. Once you've got it local (one hit to Amazon, so it's not even like they notice), you use the language of your choice to break it up and analyze it (I prefer Ruby). Part of what you'll find in any given page is more links. So when you need more stuff to analyze, just recursively go get more pages.
http://en.wikipedia.org/wiki/Wget