Tag: analytics

  • Retail Sales, Store Item Demand Time-Series Analysis/Forecasting: AutoEDA, FB Prophet, SARIMAX & Model Tuning

    Retail Sales, Store Item Demand Time-Series Analysis/Forecasting: AutoEDA, FB Prophet, SARIMAX & Model Tuning

    This study compares and evaluates various forecasting models to predict sales and demand for retail businesses. The focus is on Time Series Analysis (TSA) methods such as FB Prophet and SARIMAX. The final FB Prophet model yields MAE=4.252 and MAPE=0.168, while SARIMAX models’ best performing variant achieves MAE=6.285 and MAPE=0.213. The study emphasizes the importance…

  • A Comprehensive Analysis of Best Trading Technical Indicators w/ TA-Lib – Tesla ’23

    A Comprehensive Analysis of Best Trading Technical Indicators w/ TA-Lib – Tesla ’23

    This study presents a comprehensive stock technical analysis guide for Tesla (TSLA) using the TA-Lib Python library. It explores the use of over 200 technical indicators, analyses historical data, and offers insight for both swing traders and long-term holders. The content includes detailed explanations and plots for various momentum, volume, volatility, and trend indicators, providing…

  • Sales Forecasting: tslearn, Random Walk, Holt-Winters, SARIMAX, GARCH, Prophet, and LSTM

    Sales Forecasting: tslearn, Random Walk, Holt-Winters, SARIMAX, GARCH, Prophet, and LSTM

    The data science project involves evaluating various sales forecasting algorithms in Python using a Kaggle time-series dataset. The forecasting algorithms include tslearn, Random Walk, Holt-Winters, SARIMA, GARCH, Prophet, LSTM and Di Pietro’s Model. The goal is to predict next month’s sales for a list of shops and products, which slightly changes every month. The best…

  • Prediction of NASA Turbofan Jet Engine RUL: OLS, SciKit-Learn & LSTM

    Prediction of NASA Turbofan Jet Engine RUL: OLS, SciKit-Learn & LSTM

    We predict the Remaining Useful Life (RUL) of NASA turbofan jet engines by comparing the statsmodels OLS, ML SciKit-Learn regression vs LSTM Keras in Python. The input dataset is the Kaggle version of the public dataset for asset degradation modeling from NASA. It includes Run-to-Failure simulated data from turbo fan jet engines.

  • Health Insurance Cross Sell Prediction with ML Model Tuning & Validation

    Health Insurance Cross Sell Prediction with ML Model Tuning & Validation

    The content discusses the use of AI and Machine Learning (ML) for insurance cross-selling. It covers topics such as data preparation, model training with different algorithms, parameter optimization, and model evaluation. The study showcases the ability of ML models (HGBM, XGBoost, Random Forest) to predict cross-sell customers in the insurance sector, providing potential for improved…

  • A Balanced Mix-and-Match Time Series Forecasting: ThymeBoost, Prophet, and AutoARIMA

    A Balanced Mix-and-Match Time Series Forecasting: ThymeBoost, Prophet, and AutoARIMA

    The post evaluates the performance of popular Time Series Forecasting (TSF) methods, namely AutoARIMA, Facebook Prophet, and ThymeBoost on four real-world time series datasets: Air Passengers, U.S. Wholesale Price Index (WPI), BTC-USD price, and Peyton Manning. Each TSF model uses historical data to identify trends and make future predictions. Studies indicate that ThymeBoost, which combines…

  • Plotly Dash TA Stock Market App

    Plotly Dash TA Stock Market App

    The post explains how to deploy a Plotly Dash stock market app in Python with the dashboard of user-defined stock prices. This includes technical indicators like volume, MACD, and stochastic. The steps include selecting a stock ticker symbol (NVDA), retrieving stock data from yfinance API, adding Moving Averages, saving the stock chart in HTML form,…

  • Low-Code AutoEDA of Dutch eHealth Data in Python

    Low-Code AutoEDA of Dutch eHealth Data in Python

    The article details the usage of Python’s Low-Code AutoEDA for examining Dutch Healthcare Authority’s eHealth data. Utilizing various Python libraries like D-Tale, SweetViz, etc., the study aims to understand the healthcare data’s key features to ready it for AI techniques. The motivations include the Dutch government’s support for digital healthcare applications, especially amidst the recent…

  • Dividend-NG-BTC Diversify Big Tech

    Dividend-NG-BTC Diversify Big Tech

    SEO Title: Can Dividends, Natural Gas and Crypto Diversify Big Techs? Ultimately, we need to answer the following fundamental question: Can Dividend Kings, NGUSD and BTC-USD Diversify Growth Tech assets? Dividends are very popular among investors, especially those who want a steady stream of income from their investments. Some companies choose to share their profits…

  • Anomaly Detection using the Isolation Forest Algorithm

    Anomaly Detection using the Isolation Forest Algorithm

    The post describes the application of Isolation Forest, an unsupervised anomaly detection algorithm, to identify abnormal patterns in financial and taxi ride data. The challenge is to accurately distinguish normal and abnormal data points for fraud detection, fault diagnosis, and outlier identification. Using real-world datasets of financial transactions and NYC taxi rides, the algorithm successfully…

  • IQR-Based Log Price Volatility Ranking of Top 19 Blue Chips

    IQR-Based Log Price Volatility Ranking of Top 19 Blue Chips

    The focus is on risk assessment of top blue chips. We determine market regimes using standard deviation (STD) of log-domain stock prices.

  • An Overview of Video Games in 2023: Trends, Technology, and Market Research

    An Overview of Video Games in 2023: Trends, Technology, and Market Research

    The gaming industry is rapidly growing, projected to reach a revenue of $365.6 billion in 2023. Major trends include Web3 gaming, AI integration, and a push for consolidation. Fashion brands collaborate for virtual sales, and advances in gaming technology, such as AR/VR and cloud-based gaming, promise an even more immersive experience for gamers.

  • A Comparison of Automated EDA Tools in Python: Pandas-Profiling vs SweetViz

    A Comparison of Automated EDA Tools in Python: Pandas-Profiling vs SweetViz

    Exploratory Data Analysis (EDA) is an important part of data science projects, designed to identify patterns, anomalies, and relationships. It can employ univariate, bivariate, and multivariate data analytics, and can be accelerated using automated EDA tools. The article discusses Python libraries such as Pandas-Profiling and SweetViz for automating EDA and demonstrates their application to improve…

  • Risk-Aware Strategies for DCA Investors

    Risk-Aware Strategies for DCA Investors

    Dollar-Cost Averaging (DCA) is an investment approach that involves investing a fixed amount regularly, regardless of market price. It offers benefits such as risk reduction and market downturn resilience. It’s useful for beginners and can be combined with other strategies for a disciplined investment approach. References include Investopedia and Yahoo Finance.

  • Working with FRED API in Python: U.S. Recession Forecast & Beyond

    Working with FRED API in Python: U.S. Recession Forecast & Beyond

    The FRED API, or Federal Reserve Economic Data, provides over 267,000 economic time series from 80 sources, offering a wealth of data to promote economic education and research. It encompasses U.S. economic and financial data, including interest rates, monetary indicators, exchange rates, and regional economic data. Additionally, we analyzed correlations, trained currency exchange prediction models,…

  • Video Game Sales Data Exploration

    Video Game Sales Data Exploration

    The post explores the gaming industry’s size and state, highlighting a potential market value of $314bn by 2027. It emphasizes the industry’s three main subsectors: console, PC, and smartphone gaming. Moreover, the post conducts extensive data analysis on video game sales data, using Python to examine aspects such as genre profitability, platform sales prices, and…

  • Data Visualization in Python – 1. Stock Technical Indicators

    Data Visualization in Python – 1. Stock Technical Indicators

    Featured Photo by Monstera on Pexels. In this project, we will implement the following Technical Indicators in Python: Conventionally, we will look at the following three main groups of technical indicators: Input Stock Data Let’s set the working directory VIZ import osos.chdir(‘VIZ’)os. getcwd() and import the key libraries import datetime as dtimport pandas as pdimport…

  • Deep Reinforcement Learning (DRL) on $MO 8.07% DIV USA Stock Data 2022-23

    Deep Reinforcement Learning (DRL) on $MO 8.07% DIV USA Stock Data 2022-23

    This study applies the Deep Reinforcement Learning (DRL) algorithm to USA stocks with +4% DIV in 2022-23, focusing on Altria Group, Inc. The study addresses accurate stock price predictions and the challenges in traditional methods. Recent advances in DRL have shown improved accuracy in stock forecasting, making it suitable for turbulent markets and investment decision-making.

  • LSTM Price Predictions of 4 Tech Stocks

    LSTM Price Predictions of 4 Tech Stocks

    The given content explains the process of using Exploratory Data Analysis (EDA) and Long Short-Term Memory (LSTM) Sequential model for comparing the risk/return of four major tech stocks: Apple, Google, Microsoft, and Amazon, considering the tech scenario in 2023. The analysis involves examining stock price patterns, their correlations, risk-return assessment, and predicting stock prices using…

  • SARIMAX Crude Oil Prices Forecast – 2. Brent

    SARIMAX Crude Oil Prices Forecast – 2. Brent

    This study focuses on validating the EIA energy forecast for the 2023 Brent crude oil spot price using SARIMAX time-series cross-validation. It includes prerequisites, data loading, ETS decomposition, ADF test, SARIMAX modeling, predictions, model evaluation, and summary. The predictions align with the EIA forecast, with discrepancies within predicted confidence intervals.