Treffer 1 - 20 von 123

2

Polars for Data Analysis in Python
Fessel, Kimberly

Machine Learning FOS: Computer and inform... Open Source Computer and information... Data Cleaning Data Science
Zu den Favoriten
3

A scalable approach for critical care data extraction and analysis in an academic medical center
Daniel Boie, Sebastian ; Meyer-Eschenbach, Falk ; Schreiber, Fabian ; et al.
In International Journal of Medical Informatics December 2024 192

Fachzeitschrift
Zu den Favoriten
4

Data Lineage Analysis for PySpark and Python ORM Libraries ; Analýza datových toků pro PySpark a ORM knihovny jazyka Python
Jurčo, Andrej ; Parízek, Pavel ; Škoda, Petr

data lineage|data flow|p... data lineage|python|symb...
Dissertation
Zu den Favoriten
5

Optimizing Big Data Processing Workflows using PySpark and Google Cloud Platform: A Performance Evaluation of Data Locality and Caching Strategies
Vaid, Kamna ; Ghose, Udayan
International Journal of Intelligent Systems and Applications in Engineering; Vol. 12 No. 15s (2024); 249-256

Predictive Analysis Predictive Models Software Bugs SVM Priority Big Data, PySpark, data...
Fachzeitschrift
Zu den Favoriten
6

Analýza datových toků pro PySpark a ORM knihovny jazyka Python
Jurčo, Andrej ; Parízek, Pavel ; Škoda, Petr

data lineage|data flow|p... data lineage|python|symb...
Dissertation
Zu den Favoriten
7

One DSL to Rule Them All: IDE-Assisted Code Generation for Agile Data Analysis
Andrzejak, Artur ; Wenz, Oliver ; Costa, Diego

Computer Science - Softw... Computer Science - Distr... Computer Science - Human... Computer Science - Progr...
Report
Zu den Favoriten
8

STL-HDL: A new hybrid network intrusion detection system for imbalanced dataset on big data environment
Al, Samed ; Dener, Murat
In Computers & Security November 2021 110

Fachzeitschrift
Zu den Favoriten
10

Sistema de recomendação e análise de dados no retalho alimentar: “Um caso real”
Mota, João Manuel Alferes Simões Vieira da

CRISP-DM Sistema de recomendação... Análise de dados -- Data... FP-growth Retalho alimentar Venda -- Sale
Zu den Favoriten
11

Running Alchemist on Cray XC and CS Series Supercomputers: Dask and PySpark Interfaces, Deployment Options, and Data Transfer Times
Rothauge, Kai ; Ayyalasomayajula, Haripriya ; Maschhoff, Kristyn J. ; et al.

Computer Science - Distr...
Report
Zu den Favoriten
12

Optimización de tiempo de ejecución con PySpark de Hadoop de un análisis de sentimientos de tweets ; Optimization of the runtime of tweet sentiment analysis with Hadoop’s PySpark
Escobar Galaburda, Daria Angélica ; Palacio Hoz, Aida ; Universidad de Cantabria

Computación distribuida Hadoop Apache Spark PySpark Python Análisis de sentimientos...
Dissertation
Zu den Favoriten
13

Chapter 20 - Overview of big data tools: Hadoop, Spark, and Kafka
In An Introduction to Healthcare Informatics 2020:291-305

Buch
Zu den Favoriten
14

Data science for pattern recognition in agricultural large time series data: A case study on sugarcane sucrose yield
Bautista-Romero, Laura Valentina ; Sánchez-Murcia, Juan David ; Ramírez-Gil, Joaquín Guillermo
In Heliyon 28 February 2025 11(4)

Fachzeitschrift
Zu den Favoriten
15

Terör Saldırılarını İçeren Büyük Verinin Makine Öğrenmesi Teknikleri ile Analizi.
ULAŞ, Mustafa ; KARABAY, Barış
Firat University Journal of Engineering Science / Fırat Üniversitesi Mühendislik Bilimleri Dergisi. 2020, Vol. 32 Issue 1, p267-277. 11p.

Fachzeitschrift
Zu den Favoriten
16

Optimización de tiempo de ejecución con PySpark de Hadoop de un análisis de sentimientos de tweets
Escobar Galaburda, Daria Angélica ; Palacio Hoz, Aida ; Universidad de Cantabria
UCrea Repositorio Abierto de la Universidad de Cantabria
Universidad de Cantabria (UC)

Análisis de sentimientos... TextBlob Apache Spark Hadoop Sentiment Analysis Computación distribuida
Dissertation
Zu den Favoriten
17

Pipeline de dados para análise epidemiológica de casos sobre transtornos mentais relacionados ao trabalho no Brasil
LUNA, Pedro Henrique Santiago de ; ALMEIDA FILHO, Adiel Teixeira de ; http://lattes.cnpq.br/9944976090960730

Python Pyspark Pandas Power BI ETL TMRT
Dissertation
Zu den Favoriten
18

Exploiting Apache Spark platform for CMS computing analytics
Meoni, Marco ; Kuznetsov, Valentin ; Menichetti, Luca ; et al.

Physics - Data Analysis,... Physics - Computational...
Report
Zu den Favoriten
19


Tolikas Sofoklis ; Tolikas Sofoklis

Zu den Favoriten
20

Data Engineering and Failure Prediction for Hard Drive S.M.A.R.T. Data
Ramanayaka Mudiyanselage, Asanga

Computer Science Machine Learning Data Engineering Python Data Analysis Big Data Predictive Analytics, Fe...
Dissertation
Zu den Favoriten

Filter