NumPy 或 Pandas:将数组类型保持为整数，同时具有 NaN 值

2023-03-11 Python开发问题跟版网

NumPy or Pandas: Keeping array type as integer while having a NaN value(NumPy 或 Pandas:将数组类型保持为整数，同时具有 NaN 值)

本文介绍了NumPy 或 Pandas:将数组类型保持为整数，同时具有 NaN 值的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着跟版网的小编来一起学习吧！

问题描述

是否有一种首选方法可以将 numpy 数组的数据类型固定为 int (或 int64 或其他)，同时仍然里面有一个元素列为 numpy.NaN?

Is there a preferred way to keep the data type of a numpy array fixed as int (or int64 or whatever), while still having an element inside listed as numpy.NaN?

特别是，我正在将内部数据结构转换为 Pandas DataFrame.在我们的结构中，我们有仍然有 NaN 的整数类型列(但列的 dtype 是 int).如果我们将其设为 DataFrame，似乎会将所有内容重铸为浮点数，但我们真的很想成为 int.

In particular, I am converting an in-house data structure to a Pandas DataFrame. In our structure, we have integer-type columns that still have NaN's (but the dtype of the column is int). It seems to recast everything as a float if we make this a DataFrame, but we'd really like to be int.

想法?

尝试过的事情:

我尝试使用 pandas.DataFrame 下的 from_records() 函数和 coerce_float=False 但这没有帮助.我还尝试使用 NumPy 掩码数组和 NaN fill_value，这也不起作用.所有这些都导致列数据类型变为浮点数.

I tried using the from_records() function under pandas.DataFrame, with coerce_float=False and this did not help. I also tried using NumPy masked arrays, with NaN fill_value, which also did not work. All of these caused the column data type to become a float.

推荐答案

此功能已添加到 pandas(从 0.24 版本开始):https://pandas.pydata.org/pandas-docs/version/0.24/whatsnew/v0.24.0.html#optional-integer-na-support

This capability has been added to pandas (beginning with version 0.24): https://pandas.pydata.org/pandas-docs/version/0.24/whatsnew/v0.24.0.html#optional-integer-na-support

此时，它需要使用扩展dtype Int64(大写)，而不是默认dtype int64(小写).

At this point, it requires the use of extension dtype Int64 (capitalized), rather than the default dtype int64 (lowercase).

这篇关于NumPy 或 Pandas:将数组类型保持为整数，同时具有 NaN 值的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持跟版网！

本站部分内容来源互联网,如果有图片或者内容侵犯了您的权益，请联系我们，我们会在确认后第一时间进行删除！

上一篇：如何在 PyQt5 GUI 中制作快速 matplotlib 实时绘图下一篇：如何在 Python 中将带点和逗号的字符串转换为浮点数

相关文档推荐

在xarray中按单个维度的多个坐标分组

groupby multiple coords along a single dimension in xarray(在xarray中按单个维度的多个坐标分组)

Pandas中的GROUP BY AND SUM不丢失列

Group by and Sum in Pandas without losing columns(Pandas中的GROUP BY AND SUM不丢失列)

GROUP BY+新列+基于条件的前一行抓取值

Group by + New Column + Grab value former row based on conditionals(GROUP BY+新列+基于条件的前一行抓取值)

PANDA中的Groupby算法和插值算法

Groupby and interpolate in Pandas(PANDA中的Groupby算法和插值算法)

PANAS-基于列对行进行分组，并将NaN替换为非空值

Pandas - Group Rows based on a column and replace NaN with non-null values(PANAS-基于列对行进行分组，并将NaN替换为非空值)

按10分钟间隔对 pandas 数据帧进行分组

Grouping pandas DataFrame by 10 minute intervals(按10分钟间隔对 pandas 数据帧进行分组)

栏目导航

前端开发问题 Java开发问题 C/C++开发问题 Python开发问题 C#/.NET开发问题 php开发问题移动开发问题数据库问题

最新文章

热门文章

热门标签

五金机械教育培训机械设备环保公司新闻资讯服装服饰营销型轴承电子元件零部件电子科技电子产品环保科技培训机构电子商城双语中英双语织梦模板 dede 外语学校竞价网站源码竞价培训网门户网站织梦笑话网 dedecms笑话网织梦源码网站建设搞笑图片织梦教程旅游网站源码织梦旅游网学校培训 html5 企业织梦源码医院源码后台样式移动营销页 chatgpt 整形医院大学医院新手建站客服代码洗衣机维修企业网站淘宝客导航菜单教育网站学校源码装修网站装修模板美容整形女性健康妈妈网机械源码建站公司珠宝首饰苹果网站手机资讯管理平台织梦模版打包妇科源码安卓市场源码男性时尚网健康之家 app应用网站笑话网站下载站车辆管理系统中医院网站家装网站源码