Tag: Python
-
Morocco Earthquake EDA

Featured design via Canva. Clickable Table of Contents Basic Installations and Imports Let’s set the working directory YOURPATH Let’s install and import the following libraries Download Earthquake Input Data For this project, we’ll use a dataset that contains all seismic events over the last seven days, which have a magnitude of 1.0 or greater: Output:…
-
Wind Farm AEP Optimization in SciPy – Horns Rev 1 Site

The post outlined the process of optimizing the Annual Energy Production (AEP) of the Horns Rev 1 wind farm using the SciPy algorithm. The results derived from SciPy were cross-validated using other optimizers such as Optuna and PyWake. The study iterates the importance of offshore wind farms in meeting Denmark’s goals of increased electricity generation.
-
Robust Fake News Detection: NLP Algorithms for Deep Learning and Supervised ML in Python

The project aims at setting up a robust system for fake news detection using Python. The system adopts a hybrid framework, leveraging Natural Language Processing (NLP) techniques to classify text-based fake vs real news. Involving exploratory data analysis, multi-model training, testing, validation, and performance metrics comparison, it assesses different Deep Learning, Supervised Machine Learning, and…
-
Supervised ML Room Occupancy IoT

The article presents a study on applying machine learning (ML) to IoT sensor data for workspace occupancy detection. Comparing 14 popular scikit-learn classifiers, the ML systems built use the gathered IoT sensor data to predict room occupancy with high certainty. The results suggest temperature and light are the significant factors affecting occupancy detection. The study…
-
EUR/USD Forecast: Prophet vs JPM

JP Morgan (JPM) analysis predicts the EUR/USD exchange rate to hold at 1.08 in December 2023, while ING forecasts suggest rates of $1.00 throughout 2023 and $1.02 in Q1 2024, rising to $1.10 by Q4 2024. Using the FB Prophet model, predictions show a hold at 1.08 +/- 0.07 in December 2023, aligning with JPM’s…
-
OpenAI’s ChatGPT & Streamlit VA Chatbots

The project aims to develop a Virtual Assistant combining OpenAI and Streamlit to optimize their benefits. The assistant uses OpenAI’s ChatGPT to create human-like conversational dialogues. Leveraging GPT-3, a neural network model, ChatGPT formulates responses based on significant data inputs. The project demonstrates how to generate a LinkedIn post, implement a text Q&A session, create…
-
WA House Price Prediction: EDA-ML-HPO

A predictive model of house sale prices in King County, Washington, was developed using multiple supervised machine learning (ML) regression models, including LinearRegression, SGDRegressor, RandomForestRegressor, XGBRegressor, and AdaBoostRegressor. The best-performing model, XGBRegressor, explained 90.6% of the price variance, with a RMSE of $18472.7. These results, valuable to local realtors, indicate houses with a waterfront are…
-
NLP & Stock Impact of ChatGPT-Related Tweets

This Python project extends a recent study on half a million tweets about OpenAI’s language model, ChatGPT. It uncovers public sentiment about this rapidly growing app and examines its impact on the future of AI-powered LLMs, including stock influences. The project uses data analysis techniques such as text processing, sentiment analysis, identification of key influencers,…
-
ML Prediction of High/Low Video Game Hits with Data Resampling and Model Tuning

The post outlines a ML-based approach to forecast video game sales, using several techniques to enhance training, accuracy, and prediction. The Kaggle’s VGChartz dataset, containing sales data and other game-specific information, was used to build and refine the model. Several ML techniques including RandomForestClassifier and Logistic Regression yielded top predictors, with the critic’s score deemed…
-
An Overview of Video Games in 2023: Trends, Technology, and Market Research

The gaming industry is rapidly growing, projected to reach a revenue of $365.6 billion in 2023. Major trends include Web3 gaming, AI integration, and a push for consolidation. Fashion brands collaborate for virtual sales, and advances in gaming technology, such as AR/VR and cloud-based gaming, promise an even more immersive experience for gamers.
-
Customer Reviews NLP Spacy Analysis and ML/AI Demand Forecasting of the Steam PC Video Game Service

Steam, a leading digital distribution platform for PC gaming, has seen over 6000 new games released in 2022, averaging over 34 games each day. This post aims to conduct comprehensive customer reviews NLP sentiment analysis and ML/AI demand forecasting using public-domain datasets. It covers EDA, NLP Spacy analysis, ML/AI pipeline, model validation, word clouds, and…
-
Comparison of 20 ML + NLP Algorithms for SMS Spam-Ham Binary Classification

This post analyzes a public-domain SMS text message dataset to compare various machine learning algorithms’ abilities to classify spam and ham messages. After implementing a Python workflow that includes data preparation, exploratory analysis, natural language processing, supervised machine learning binary classification, and a model performance analysis, the author finds that MLP, Logistic Regression CV, Linear…
-
A Comparison of Automated EDA Tools in Python: Pandas-Profiling vs SweetViz

Exploratory Data Analysis (EDA) is an important part of data science projects, designed to identify patterns, anomalies, and relationships. It can employ univariate, bivariate, and multivariate data analytics, and can be accelerated using automated EDA tools. The article discusses Python libraries such as Pandas-Profiling and SweetViz for automating EDA and demonstrates their application to improve…
-
NLP of Restaurant Guest Reviews on Tripadvisor

This is a comprehensive study examining restaurant reviews on TripAdvisor across 31 major European cities. The research, based on a dataset scraped from TripAdvisor, aims to perform a sentiment analysis of reviews, exploring average ratings per city, vegetarian-friendly cities, and how local cuisine compares to foreign food. The analysis is carried out using Python, demonstrating…
-
Improved Multiple-Model ML/DL Credit Card Fraud Detection: F1=88% & ROC=91%

In 2023, the global card industry is projected to suffer $36.13 billion in fraud losses. This has necessitated a priority focus on enhancing credit card fraud detection by banks and financial organizations. AI-based techniques are making fraud detection easier and more accurate, with models able to recognize unusual transactions and fraud. The post discusses a…
-
Datapane Stock Screener App from Scratch

This content provides a quick guide for value investors to use the Datapane stock screener API in Python. It includes instructions for installation, importing standard libraries, setting the stock ticker, downloading stock Adj Close price, and creating visualizations. The post also describes how to build a powerful report using Datapane’s layout components.
-
Unsupervised ML, K-Means Clustering & Customer Segmentation

Table of Clickable Contents Motivation Methods Open-Source Datasets This file contains the basic information (ID, age, gender, income, and spending score) about the customers. Online retail is a transnational data set which contains all the transactions occurring between 01/12/2010 and 09/12/2011 for a UK-based and registered non-store online retail. The company mainly sells unique all-occasion…
-
Early Heart Attack Prediction using ECG Autoencoder and 19 ML/AI Models with Test Performance QC Comparisons

Table of Contents Embed Socials: ECG Autoencoder Let’s set the working directory YOURPATH import osos.chdir(‘YOURPATH’)os. getcwd() and import the following libraries import tensorflow as tfimport matplotlib.pyplot as pltimport numpy as npimport pandas as pd from tensorflow.keras import layers, lossesfrom sklearn.model_selection import train_test_splitfrom tensorflow.keras.models import Model Let’s read the input dataset df = pd.read_csv(‘ecg.csv’, header=None) Let’s…
-
Risk-Aware Strategies for DCA Investors

Dollar-Cost Averaging (DCA) is an investment approach that involves investing a fixed amount regularly, regardless of market price. It offers benefits such as risk reduction and market downturn resilience. It’s useful for beginners and can be combined with other strategies for a disciplined investment approach. References include Investopedia and Yahoo Finance.
-
GPT & DeepLake NLP: Amazon Financial Statements

The post outlines the implementation of an AI-powered chatbot using NLP to process and analyze financial data from Amazon’s financial statements. The tool employs LlamaIndex and DeepLake to answer queries, summarize financial information, and analyze trends. This approach enhances the efficiency of data analysis, making it a valuable resource for finance and banking professionals.