<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>performance on David An</title>
    <link>https://davidan.dev/tags/performance/</link>
    <description>Recent content in performance on David An</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>en-us</language>
    <lastBuildDate>Tue, 17 Jan 2023 00:00:00 +0000</lastBuildDate><atom:link href="https://davidan.dev/tags/performance/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>A Discussion on Pandas and Data Mining</title>
      <link>https://davidan.dev/posts/datamining/</link>
      <pubDate>Tue, 17 Jan 2023 00:00:00 +0000</pubDate>
      
      <guid>https://davidan.dev/posts/datamining/</guid>
      <description>At the beginning of any data analytics/data science project, the most usual case is utilizing Pandas to load the data into a object called a DataFrame and perform preprocessing tasks on it. While the Pandas library is convenient and comes with a trove of useful analytic tools, it does have some inefficiencies, many due to the nature of Python. To investigate this, let&amp;rsquo;s look at how Pandas actually works on top of Python.</description>
    </item>
    
  </channel>
</rss>
