industry-machine-learning

授权协议 Readme
开发语言 Python
所属分类 神经网络/人工智能、 机器学习/深度学习
软件类型 开源软件
地区 不详
投 递 者 督飞鸣
操作系统 跨平台
开源组织
适用人群 未知
 软件概览

Machine Learning and Data Science Applications in Industry


Sov.ai Research Lab (Sponsorship)

Animated Investment Management Research at Sov.ai — Sponsoring open source AI, Machine learning, and Data Science initiatives.


Admin

Have a look at the newly started FirmAI Medium publication where we have experts of AI in business, write about their topics of interest.

Please add your tools and notebooks to this Google Sheet. Or simply add it to this subreddit, r/datascienceproject

Highlight in YELLOW to get your package added, you can also just add it yourself with a pull request.

A curated list of applied machine learning and data science notebooks and libraries accross different industries. The code in this repository is in Python (primarily using jupyter notebooks) unless otherwise stated. The catalogue is inspired by awesome-machine-learning. r/datascienceproject is a subreddit where you can share all your data science projects.

Caution: This is a work in progress, please contribute, especially if you are a subject expert in any of the industries listed below. If you are a [analytical, computational, statistical, quantitive] researcher/analyst in field X or a field X [machine learning engineer, data scientist, modeler, programmer] then your contribution will be greatly appreciated.

If you want to contribute to this list (please do), send me a pull request or contact me @dereknow or on linkedin or get in contact on the website FirmAI.Also, a listed repository should be deprecated if:

  • Repository's owner explicitly say that "this library is not maintained".
  • Not committed for long time (2~3 years).

Help Needed: If there is any contributors out there willing to help first populate and then maintain a Python analytics section in any one of the following sub/industries, please get in contact with me. Also contact me to add additional industries.


Accommodation & Food Agriculture Banking & Insurance
Biotechnological & Life Sciences Construction & Engineering Education & Research
Emergency & Relief Finance Manufacturing
Government and Public Works Healthcare Media & Publishing
Justice, Law and Regulations Miscellaneous Accounting
Real Estate, Rental & Leasing Utilities Wholesale & Retail

Table of Contents

Industry Applications

ML/DS Career Section for Industry Machine Learning

See data-science-career repo for more.

Platforms:

  1. Triplebyte - Take a quiz. Get offers from multiple top tech companies at once (now have a machine learning track).
  2. Toptal - Developers seeking to gain entry into the Toptal community are put through a battery of personality and technical tests.
  3. Hired - Hired matches employers with qualified candidates through a combination of in-house algorithms and online support.
  4. Kaggle - Scalable Path is a premium talent matching service.

Reviews:

Accommodation & Food

Food

Restaurant

Accommodation

Accounting

Machine Learning

Analytics

  • Forensic Accounting - Collection of case studies on forensic accounting using data analysis. On the lookout for more data to practise forensic accounting, please get in touch
  • General Ledger (FirmAI) - Data processing over a general ledger as exported through an accounting system.
  • Bullet Graph (FirmAI) - Bullet graph visualisation helpful for tracking sales, commission and other performance.
  • Aged Debtors (FirmAI) - Example analysis to invetigate aged debtors.
  • Automated FS XBRL - XML Language, however, possibly port analysis into Python.

Textual Analysis

Data, Parsing and APIs

Research And Articles

  • Understanding Accounting Analytics - An article that tackles the importance of accounting analytics.
  • VLFeat - VLFeat is an open and portable library of computer vision algorithms, which has Matlab toolbox.

Websites

  • Rutgers Raw - Good digital accounting research from Rutgers.

Courses

Agriculture

Economics

  • Prices - Agricultural price prediction.
  • Prices 2 - Agricultural price prediction.
  • Yield - Agricultural analysis looking at crop yields in Ukraine.
  • Recovery - Strategic land use for agriculture and ecosystem recovery
  • MPR - Mandatory Price Reporting data from the USDA's Agricultural Marketing Service.

Development

Banking & Insurance

Consumer Finance

Management and Operation

Valuation

  • Zillow Prediction - Zillow valuation prediction as performed on Kaggle.
  • Real Estate - Predicting real estate prices from the urban environment.
  • Used Car - Used vehicle price prediction.

Fraud

Insurance and Risk

Physical

Biotechnological & Life Sciences

General

  • Programming - Python Programming for Biologists
  • Introduction DL - A Primer on Deep Learning in Genomics
  • Pose - Estimating animal poses using DL.
  • Privacy - Privacy preserving NNs for clinical data sharing.
  • Population Genetics - DL for population genetic inference.
  • Bioinformatics Course - Course materials for Computational Biologyand Bioinformatics
  • Applied Stats - Applied Statistics for High-Throughput Biology
  • Scripts - Python scripts for biologists.
  • Molecular NN - A mini-framework to build and train neural networks for molecular biology.
  • Systems Biology Simulations - Systems biology practical on writing simulators with F# and Z3
  • Cell Movement - LSTM to predict biological cell movement.
  • Deepchem - Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry, Materials Science and Biology

Sequencing

Chemoinformatics and drug discovery

  • Novel Molecules - A convolutional net that can learn features.
  • Automating Chemical Design - Generate new molecules for efficient exploration.
  • GAN drug Discovery - A method that combines generative models with reinforcement learning.
  • RL - generating compounds predicted to be active against a biological target.
  • One-shot learning - Python library that aims to make the use of machine-learning in drug discovery straightforward and convenient.

Genomics

Life-sciences

  • Plants Disease - App that detects diseases in plants using a deep learning model.
  • Leaf Identification - Identification of plants through plant leaves on the basis of their shape, color and texture.
  • Crop Analysis - An imaging library to detect and track future position of ears on maize plants
  • Seedlings - Plant Seedlings Classification from kaggle competition
  • Plant Stress - An ontology containing plant stresses; biotic and abiotic.
  • Animal Hierarchy - Package for calculating animal dominance hierarchies.
  • Animal Identification - Deep learning for animal identification.
  • Species - Big Data analysis of different species of animals
  • Animal Vocalisations - A generative network for animal vocalizations
  • Evolutionary - Evolution Strategies Tool
  • Glaciers - Educational material about glaciers.

Construction & Engineering

Construction

Engineering:

Material Science

Economics

General

Machine Learning

  • EconML - Automated Learning and Intelligence for Causation and Economics.
  • Auctions - Optimal auctions using deep learning.

Computational

Education & Research

Student

School

Emergency & Police

Preventative and Reactive

Crime

Ambulance:

  • Ambulance Analysis - An investigation of Local Government Area ambulance time variation in Victoria.
  • Site Location - Ambulance site locations.
  • Dispatching - Applying game theory and discrete event simulation to find optimal solution for ambulance dispatching
  • Ambulance Allocation - Time series analysis of ambulance dispatches in the City of San Diego.
  • Response Time - An analysis on the improvements of ambulance response time.
  • Optimal Routing - Project to find optimal routing of ambulances in Ithaca.
  • Crash Analysis - Predicting the probability of accidents on a given segment on a given time.

Disaster Management

Finance

Trading and Investment

Data

  • Datastream - Datastrem from Thomson Reuters accessible through Python.
  • AlphaVantage - API wrapper to simplify the process of acquiring free financial data.
  • FSA- A project to transfer SEC Edgar Filings’ financial data to custom financial statement analysis models.
  • TradeConnector - A layer to connect with market data providers.
  • Employee Count SEC Filings - Extraction to get the exact employee count values for companies from SEC filings.
  • SEC Parsing - NLP to find and extract specific information from long, unstructured documents
  • Open Edgar - OpenEDGAR (openedgar.io)
  • Rating Industries - Histories from multiple agencies converted to CSV format

Personal Papers

Healthcare

General

Justics, Law & Regulations

Tools

Policy and Regulatory

Judicial Applied

Manufacturing

General

Maintenance

Failure

Quality

Media & Publishing

Marketing

Miscellaneous

Art

Tourism

  • Flickr - Metadata mining tool for tourism research.
  • Fashion - A clothing retrieval and visual recommendation model for fashion images

Physics

General

Machine Learning

Government and Public Works

Social Policies

Charities

Election Analysis

Politics

  • Congressional politics - House and senate congressional partisanship.
  • Politico - A platform for profiling public figures in Brazilian politics.
  • Bots - Tools and algorithms to analyze Paraguayan Tweets in times of election
  • Gerrymander tests - Lots of metrics for quantifying gerrymandering.
  • Sentiment - Analyse newspapers with respect to their political conviction using entity sentiments of party representatives.
  • DL Politics - Prediction of Spanish Political Affinity with Deep Neural Nets: Socialist vs People's Party
  • PAC Money - Effects of PAC money on US politics.
  • Power Networks - Constructing a watchdog for Indian corporate and political networks
  • Elite - Political elite in the US.
  • Debate Analysis - Program to analyze political debates.
  • Political Affiliation - Political affiliation prediction using twitter metadata.
  • Political Ads - Investigation into Facebook Political Ads and Targeting
  • Political Identity - Multi-axial political model.
  • YT Politics - Mapping Politics on YouTube
  • Political Ideology - Unsupervised learning of political ideology by word vector projections

Real Estate, Rental & Leasing

Real Estate

  • Finding Donuts - Finding real estate opportunities by predicting transforming neighbourhoods.
  • Neighbourhood - Predicting real estate prices from the urban environment.
  • Real Estate Classification - Classifying the type of property given Real Estate, satellite and Street view Images
  • Recommender - This tools aims to recommend a user the top 5 real estate properties that matches their search.
  • House Price - Predicting house prices using Linear Regression and GBR
  • House Price Portland - Predict housing prices in Portland.
  • Zillow Prediction - Zillow valuation prediction as performed on Kaggle.
  • Real Estate - Predicting real estate prices from the urban environment.

Rental & Leasing

Utilities

Electricity

Coal, Oil & Gas

Water & Pollution

  • Safe Water - Predict health-based drinking water violations in the United States.
  • Hydrology Data - A suite of convenience functions for exploring water data in Python.
  • Water Observatory - Monitoring water levels of lakes and reservoirs using satellite imagery.
  • Water Pipelines - Using machine learning to find water pipelines in aerial images.
  • Water Modelling - Australian Water Resource Assessment (AWRA) Community Modelling System.
  • Drought Restrictions - A Los Angeles Times analysis of water usage after the state eased drought restrictions
  • Flood Prediction - Applying LSTM on river water level data
  • Sewage Overflow - Insights into the sanitary sewage overflow (SSO). - This has been removed
  • Water Accounting - Assembles water budget data for the US from existing data source
  • Air Quality Prediction - Predict air quality(aq) in Beijing and London in the next 48 hours.

Transportation

Wholesale & Retail

Wholesale

  • Customer Analysis - Wholesale customer analysis.
  • Distribution - JB wholesale distribution analysis.
  • Clustering - Unsupervised learning techniques are applied on product spending data collected for customers
  • Market Basket Analysis - Instacart public dataset to report which products are often shopped together.

Retail

Sponsors

  • Sov.ai - Animated Investment Management Research

  • 以下内容摘选自:https://github.com/ty4z2008/Qix/blob/master/dl2.md                           https://github.com/ty4z2008/Qix/blob/master/dl.md ·        《ICLR 2014论文集》 介绍:对深度学习和representationlearning最新进展有兴趣的同学

  • 全世界都在学习AI,当然我也不能例外。自动驾驶、人脸识别、遍地的机器人。。。So,今天起,我将开始着手翻译Principles of Machine Learning全书,全书共7个章节加一个导读,如果中间掺杂有实验,我也会和大家一起来完成。那么现在,让我们开始机器学习的旅程吧! Introduction Welcome to the principles of Machine Learning!

  • Introduction 1. Course Outline 1). Traditional methods: Graphlets: Graph kernels 2). Methods for node embeddings: DeepWalk, Node2Vec 3). Graph Neural Networks: GCN, GraphSAGE, GAT, Theory of GNNs 4).

 相关资料
  • 学习意味着通过学习或经验获得知识或技能。 基于此,我们可以定义机器学习(ML)如下 - 它可以被定义为计算机科学领域,更具体地说是人工智能的应用,其为计算机系统提供了学习数据和从经验改进而无需明确编程的能力。 基本上,机器学习的主要焦点是允许计算机自动学习而无需人为干预。 现在问题是如何开始和完成这种学习? 它可以从数据的观察开始。 数据可以是一些示例,指令或一些直接经验。 然后在此输入的基础上,

  • Machine Learning This project provides a web-interface,as well as a programmatic-apifor various machine learning algorithms. Supported algorithms: Support Vector Machine (SVM) Support Vector Regressio

  • machine – 与硬件相关的功能 machine 模块包含与特定开发板上的硬件相关的特定函数。 在这个模块中的大多数功能允许实现直接和不受限制地访问和控制系统上的硬件块(如CPU,定时器,总线等)。如果使用不当,会导致故障,死机,崩溃,在极端的情况下,硬件会损坏。 需要注意的是,由于不同开发板的硬件资源不同,MicroPython 移植所能控制的硬件也是不一样的。因此对于控制硬件的例程来说,在

  • Machine Learning Projects This repository contains mini projects in machine learning with jupyter notebook files.Go to the projects folder and see the readme for detailed instructions about the projec

  • Machine Learning for OpenCV This is the Jupyter notebook version of the following book: Michael Beyeler Machine Learning for OpenCV Intelligent Image Processing with Python 14 July 2017 Packt Publishi

  • Homemade Machine Learning You might be interested in �� Interactive Machine Learning Experiments For Octave/MatLab version of this repository please check machine-learning-octave project. This reposit