ETL with Python Training - Taught during Data Warehousing course - Tel Aviv University 2017
jupyter-notebook.ipynb - quick Jupiter notebook introduction and tutorial
Mysql-io.ipynb - Input/Output to MySQL using MySQLdb
connector
ETL with Python.ipynb - ETL with python using petl
package
Extra: CSV-io - csv
library usage examples
drinks.json - drinks consumption data (source)
drinks.zip - zipped json file (used for a zip file example in ETL notebook)
assuming you have default build-in world schema of MySQL (for mysql-io exercise). If not you can get it by running CreateWorld.sql - dump of sql scripy for creating world schema for mysql-io exercise.
file1.csv/file2.csv/flie3.csv - csv examples for csv
library
simpsons_phone_book.csv - csv example for sniffer
由于python的数据库操作有各种样的第三方库,相互之间使用都会有点差别(比如sql占位符就有'',':','%s'等,该工具的通配符都是":"标准),导致代码在切换数据库时也需要相应改动,应此自己封装了一个通用的库可以用python调用各种类型的数据库并且进一步简化了数据库操作步骤,并且可以自定义扩展 Installation: pip install py_db 下面都以python内置的sq
etl 表设计 Not so far ago, the approach taken to table design in source systems (application databases) used to be — we don’t care about ETL. Figure it out, we’ll concentrate on building the application.
#!/usr/bin/python2 # -*- coding: UTF-8 -*- import MySQLdb import sys import json import os import logging import time filePath = os.path.expanduser('~')+ os.sep + 'etlconfig' +os.sep
参考链接: Python 克里斯蒂安Cristian算法 python处理数据列 Let’s suppose your manager gives you a random dataset and told you to do a “basic” cleaning: “Keep only the records that have values in colu
我有一个项目,有几个不同的超文本标记语言页面。如果我用
Tauri 是一个框架,用于为所有主要桌面平台构建小巧、快速的二进制文件。开发人员可以集成任何可编译为 HTML、JS 和 CSS 的前端框架,以构建他们的用户界面。应用程序的后端是一个基于 Rust 的二进制文件,带有一个前端可以与之交互的 API。 Tauri 应用程序的用户界面目前在 macOS 和 Windows 上利用 tao 作为窗口处理库,在 Linux 上通过 Tauri 团队孵化
TAU (Tuning and Analysis Utilities) 是一个用来分析C/C++、Fortran、Python和Java程序性能的工具。 It collects much more information than is available through prof or gprof, the standard Unix utilities, including per-proce
Intro This is a guide based on our experiences of going from help desk to Cloud and DevOps engineers. You can learn more about our journeys into cloud here: Rishab's journey from help desk to DevOps e
tauCharts 是基于 D3 的,专注于数据的图表库。 使用 散点图 var chart1 = new tauCharts.Chart({ type: 'scatterplot', x: 'milespergallon', y: 'price', data: [ {class: "C", milespergallon: 41.26, price: 24509.74, vehicl
Taurus 可以实现有趣的下拉刷新动画。
Taurus.MVC 是一个简单的 asp.net 的 MVC 框架。
TAU UI Studio 是一个 Web 应用设计工具,不只是静态网站,同时包括创建一个完整的基于 HTML5 和 CSS3 的移动 web 网站,主要特性: 所见即所得的设计编辑器 You can design a web application without CSS knowledge. 支持 CSS3 You don't have to taking care of vendor pre