Tag: regression
-
Python Data Science for Real Estate & REIT Amsterdam: (Auto) EDA, NLP, Maps & ML

The Amsterdam real estate market has experienced a significant resurgence, with property prices increasing by double digits annually since 2013. Data science is being used to analyze the city’s housing and rental markets, revealing insights on the impact of Airbnb and empowering communities with the necessary information. Comprehensive data analysis and machine learning techniques are…
-
Time Series Data Imputation, Interpolation & Anomaly Detection

The post compares popular time series data imputation, interpolation, and anomaly detection methods. It explores the challenges of missing data and the impact on processing, analyzing, and model accuracy. The study performs data-centric experiments to benchmark optimal methods and highlights the importance of imputation for time series forecasting. It provides practical strategies and techniques for…
-
Leveraging Predictive Uncertainties of Time Series Forecasting Models

Featured Image via Canva. Table of Contents Introduction Random Simulation Tests TSLA Stock 43 Days TSLA Stock 300 Days Housing in the United States Industrial Production Federal Funds Rate Data S&P 500 Absolute Returns Number of Airline Passengers- 1. Holt-Winters Number of Airline Passengers- 2. Prophet Average Temperature in India Monthly Sales Data Analysis QC…
-
A Balanced Mix-and-Match Time Series Forecasting: ThymeBoost, Prophet, and AutoARIMA

The post evaluates the performance of popular Time Series Forecasting (TSF) methods, namely AutoARIMA, Facebook Prophet, and ThymeBoost on four real-world time series datasets: Air Passengers, U.S. Wholesale Price Index (WPI), BTC-USD price, and Peyton Manning. Each TSF model uses historical data to identify trends and make future predictions. Studies indicate that ThymeBoost, which combines…
-
WA House Price Prediction: EDA-ML-HPO

A predictive model of house sale prices in King County, Washington, was developed using multiple supervised machine learning (ML) regression models, including LinearRegression, SGDRegressor, RandomForestRegressor, XGBRegressor, and AdaBoostRegressor. The best-performing model, XGBRegressor, explained 90.6% of the price variance, with a RMSE of $18472.7. These results, valuable to local realtors, indicate houses with a waterfront are…
-
Dabl Auto EDA-ML

Dabl, short for Data Analysis Baseline Library, is a high-level data exploration library in Python that automates repetitive data wrangling tasks in the early stages of supervised machine learning model development. Developed by Andreas Mueller and the scikit-learn community, it facilitates data preprocessing, advanced integrated visualization, exploratory data analysis (EDA), and ML model development, demonstrated…
-
Breast Cancer ML Classification – Logistic Regression vs Gradient Boosting with Hyperparameter Optimization (HPO)

Breast Cancer (BC) is the leading cause of death among women worldwide. The present study optimizes the use of supervised Machine Learning (ML) algorithms for detecting, analyzing, and classifying BC. We compare Logistic Regression (LR) against Gradient Boosting (GB) Classifier within the Hyperparameter Optimization (HPO) loop given by GridSearchCV. We use the publicly available BC dataset…
-
ANOVA-OLS Prediction of Surgical Volumes
Operating rooms (ORs) are some of the most valuable hospital assets, generating a large part of hospital revenue. Statistical models have been developed using datasets to predict daily surgical volumes weeks in advance. We focus on the VUMC dataset for evaluation of our statistical models. We use the ANOVA null-hypothesis test for the total number…
-
ML/AI Regression for Stock Prediction – AAPL Use Case
1. Install Yahoo finance library 2. Call all dependencies that we will use for this exercise 3. Define the ticker you will use 4. Let’s look at the data table 5. Data Exploration Phase 6. Data Preparation, Pre-Processing & Manipulation 7. Apply Linear Regression 8. Perform ML QC Analysis 9. Final Output
-
ML/AI Regression for Stock Prediction – AAPL Use Case
The following is a set of steps intended for ML/AI regression to predict stock prices. The objective is to simulate available historical stock prices of $AAPL using the SciKit Learn library. 1. Install Yahoo finance library !pip install yfinance 2. Let’s call all dependencies that we will use for this exercise import pandas as pd import…
-
Supervised Machine Learning Use Case: Prediction of House Prices
This is the application of supervised machine learning to real estate. The goal is to predict sale prices ($) for N selected properties in a state (N>>1000). We are given a csv dataset as a NxM table, where M is the number of property features describing every aspect of the house and surroundings (typically, M<100). …