arXiv Analytics

Sign in

arXiv:cs/0007026 [cs.LG]AbstractReferencesReviewsResources

Integrating E-Commerce and Data Mining: Architecture and Challenges

Suhail Ansari, Ron Kohavi, Llew Mason, Zijian Zheng

Published 2000-07-14Version 1

We show that the e-commerce domain can provide all the right ingredients for successful data mining and claim that it is a killer domain for data mining. We describe an integrated architecture, based on our expe-rience at Blue Martini Software, for supporting this integration. The architecture can dramatically reduce the pre-processing, cleaning, and data understanding effort often documented to take 80% of the time in knowledge discovery projects. We emphasize the need for data collection at the application server layer (not the web server) in order to support logging of data and metadata that is essential to the discovery process. We describe the data transformation bridges required from the transaction processing systems and customer event streams (e.g., clickstreams) to the data warehouse. We detail the mining workbench, which needs to provide multiple views of the data through reporting, data mining algorithms, visualization, and OLAP. We con-clude with a set of challenges.

Comments: KDD workshop: WebKDD 2000
Journal: WEBKDD'2000 workshop: Web Mining for E-Commerce -- Challenges and Opportunities
Categories: cs.LG, cs.AI, cs.CV, cs.DB
Subjects: I.2.6, H.2.8
Related articles: Most relevant | Search more
arXiv:1711.04708 [cs.LG] (Published 2017-11-13)
Machine Learning for the Geosciences: Challenges and Opportunities
arXiv:1901.00248 [cs.LG] (Published 2019-01-02)
A Survey on Multi-output Learning
arXiv:2308.06780 [cs.LG] (Published 2023-08-13)
Neural Networks at a Fraction with Pruned Quaternions