Tag: data
-
Time Series Data Imputation, Interpolation & Anomaly Detection

The post compares popular time series data imputation, interpolation, and anomaly detection methods. It explores the challenges of missing data and the impact on processing, analyzing, and model accuracy. The study performs data-centric experiments to benchmark optimal methods and highlights the importance of imputation for time series forecasting. It provides practical strategies and techniques for…
-
Uber’s Orbit Full Bayesian Time Series Forecasting & Inference

This article introduces Orbit, an open-source Python framework by Uber for full Bayesian time series forecasting and inference. It supports models like Exponential Smoothing, Local Global Trend, and Kernel Time-based Regression, along with methods like Markov-Chain Monte Carlo and Variational Inference. Orbit captures uncertainty in time-series data, allowing credible probabilistic forecasts with confidence intervals. The…
-
Practical SQL Queries, Cheat Sheets, and Interview Q&A for Data Scientists

Professionals aspiring for a career in data science must master SQL, a crucial skill. This comprehensive SQL server tutorial includes practical exercises, cheat sheets, interview Q&A tailored to data scientists, and installation requirements. From RDBMS basics to advanced concepts for data science interviews, this resource emphasizes the significance of SQL in database operations.
-
Wind Energy ML Prediction & Turbine Power Control

This text presents a detailed project on modeling the power curve of a wind turbine, which is crucial in wind energy management and forecasting. By using machine learning techniques such as Random Forest and Gradient Boosting Regressors, and validating with real-world Scada data from a Turkish wind farm, the project shows it’s possible to create…
-
EUR/USD Forecast: Prophet vs JPM

JP Morgan (JPM) analysis predicts the EUR/USD exchange rate to hold at 1.08 in December 2023, while ING forecasts suggest rates of $1.00 throughout 2023 and $1.02 in Q1 2024, rising to $1.10 by Q4 2024. Using the FB Prophet model, predictions show a hold at 1.08 +/- 0.07 in December 2023, aligning with JPM’s…
-
WA House Price Prediction: EDA-ML-HPO

A predictive model of house sale prices in King County, Washington, was developed using multiple supervised machine learning (ML) regression models, including LinearRegression, SGDRegressor, RandomForestRegressor, XGBRegressor, and AdaBoostRegressor. The best-performing model, XGBRegressor, explained 90.6% of the price variance, with a RMSE of $18472.7. These results, valuable to local realtors, indicate houses with a waterfront are…
-
NLP & Stock Impact of ChatGPT-Related Tweets

This Python project extends a recent study on half a million tweets about OpenAI’s language model, ChatGPT. It uncovers public sentiment about this rapidly growing app and examines its impact on the future of AI-powered LLMs, including stock influences. The project uses data analysis techniques such as text processing, sentiment analysis, identification of key influencers,…
-
An Overview of Video Games in 2023: Trends, Technology, and Market Research

The gaming industry is rapidly growing, projected to reach a revenue of $365.6 billion in 2023. Major trends include Web3 gaming, AI integration, and a push for consolidation. Fashion brands collaborate for virtual sales, and advances in gaming technology, such as AR/VR and cloud-based gaming, promise an even more immersive experience for gamers.
-
Overview of AWS Tech Portfolio 2023

This summary focuses on the extensive capabilities of Amazon Web Services (AWS) by 2023, highlighting its 27% year-on-year growth and a net sales increase to $127.1 billion. AWS emerges as the top cloud service provider, offering over 200 services including compute, storage, databases, networking, AI, and machine learning. It is constantly expanding operations, having opened…
-
Trending YouTube Video Data Science, NLP Predictions & Sentiment Analysis

Table of Contents Global YT WordCloud Let’s begin with the Kaggle YT TextHero dataset containing 3599 rows and 4 columns. Let’s set the working directory YOURPATH import osos.chdir(‘YOURPATH’) os. getcwd() and import all necessary modulesfrom wordcloud import WordCloud, STOPWORDSimport matplotlib.pyplot as pltimport pandas as pd Let’s read the input dataset df = pd.read_csv(r”youtube0.csv”, encoding =”latin-1″)…
