Big Data Tools on Amazon Product Reviews Dataset


Data Mining on Amazon Product Reviews Dataset. This work was part of the final project for the Computational Tools for Big Data course offered by DTU, A.Y. 2016/17. By leveraging big data technologies such as Apache Spark, Neo4j, Pandas DataFrames, SQL, the overall goal was to exploit the potential of such tools to carry out an extensive analysis on an inconveniently large dataset, with all the drawbacks that kick in when it comes to storing, handling and processing the data.

Links


About Riccardo

Hey! Thank you for taking the time to check out my contents and I really hope you enjoyed them. My name is Riccardo, I'm the content creator of this website and the developer who built it. I'm a Software Engineer currently based in Copenhagen, Denmark. Please feel free to reach out for any questions or feedback. You may leave a comment below, send me a message or connect through the social media links you find in this website.