当前位置: 首页 > 软件库 > Web应用开发 > Web框架 >

spark-movie-lens

授权协议 View license
开发语言 Python
所属分类 Web应用开发、 Web框架
软件类型 开源软件
地区 不详
投 递 者 齐英韶
操作系统 跨平台
开源组织
适用人群 未知
 软件概览

A scalable on-line movie recommender using Spark and Flask

This Apache Spark tutorial will guide you step-by-step into how to use the MovieLens dataset to build a movie recommender using collaborative filtering with Spark's Alternating Least Saqures implementation. It is organised in two parts. The first one is about getting and parsing movies and ratings data into Spark RDDs. The second is about building and using the recommender and persisting it for later use in our on-line recommender system.

This tutorial can be used independently to build a movie recommender model based on the MovieLens dataset. Most of the code in the first part, about how to use ALS with the public MovieLens dataset, comes from my solution to one of the exercises proposed in the CS100.1x Introduction to Big Data with Apache Spark by Anthony D. Joseph on edX, that is also publicly available since 2014 at Spark Summit. Starting from there, I've added with minor modifications to use a larger dataset, then code about how to store and reload the model for later use, and finally a web service using Flask.

In any case, the use of this algorithm with this dataset is not new (you can Google about it), and this is because we put the emphasis on ending up with a usable model in an on-line environment, and how to use it in different situations. But I truly got inspired by solving the exercise proposed in that course, and I highly recommend you to take it. There you will learn not just ALS but many other Spark algorithms.

It is the second part of the tutorial the one that explains how to use Python/Flask for building a web-service on top of Spark models. By doing so, you will be able to develop a complete on-line movie recommendation service.

Part I: Building the recommender

Part II: Building and running the web service

Quick start

The file server/server.py starts a CherryPy server running aFlask app.py to start a RESTfulweb server wrapping a Spark-based engine.py context. Through its API we canperform on-line movie recommendations.

Please, refer the the second notebook for detailed instructions on how to run and use the service.

Contributing

Contributions are welcome! For bug reports or requests please submit an issue.

Contact

Feel free to contact me to discuss any issues, questions, or comments.

License

This repository contains a variety of content; some developed by Jose A. Dianes, and some from third-parties. The third-party content is distributed under the license provided by those parties.

The content developed by Jose A. Dianes is distributed under the following license:

Copyright 2016 Jose A Dianes

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

   http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
 相关资料
  • 标签 描述 video 设定影片参数 openvideo 读取影片 playvideo 播放影片 pausevideo 暂停影片 stopvideo 停止播放影片 resumevideo 播放暂停的影片 rewindvideo 倒带 videosegloop cancelvideosegloop 影片在某一个范围内循环播放 取消影片的循环播放 wp wv 等待影片周期 等待影片停止 prepare

  • Movie Vote 是一个基于 Django 开发的电影投票系统,可根据用户的投票接口来进行电影放映的排期。

  • 深度影院(英文名:Deepin Movie)是deepin 团队为开发的一款全能视频播放器。 深度影院前端使用QML开发,后端则使用QTAV进行解码,拥有简洁的外观和强劲的解码能力,对于目前常见的视频格式和4K等高清视频毫无压力。 深度影院是一款自由软件,遵循自由软件基金会发布的 GNU 通用公共许可证第三版。 特性 无边框设计,视频展现效果好 全能解码,资源占用低 支持字幕和播放进度记忆 支持流

  • Griffith 是一个电影数据收集和管理软件,你可简单的通过输入电影的名称来添加该电影,Griffith 会尝试从网络中获取该电影的详细信息。

  • React Movie Cards A basic application that displays a list of movies as a list of cards. Each card provides movie detail such as title, subtitle, description, image, and rating. Go here for live demo.

  • GIF Movie Gear 是一个专业的 GIF 动画软件,支持 APNG。