问题：

ApachePOISAX解析-如何获取单元格的实际值

长孙淳

2023-03-14

我需要使用内存有限的ApachePOI解析非常大的excel文件。在谷歌搜索之后，我知道poi提供了SAX解析器来有效地解析大文件，而不需要消耗大量内存。

apachepoisax解析器示例

  private class SheetToCSV implements SheetContentsHandler {
    private boolean firstCellOfRow = false;
    private int currentRow = -1;
    private int currentCol = -1;

    private void outputMissingRows(int number) {
        for (int i=0; i<number; i++) {
            for (int j=0; j<minColumns; j++) {
                output.append(',');
            }
            output.append('\n');
        }
    }

    @Override
    public void startRow(int rowNum) {
        // If there were gaps, output the missing rows
        outputMissingRows(rowNum-currentRow-1);
        // Prepare for this row
        firstCellOfRow = true;
        currentRow = rowNum;
        currentCol = -1;
    }

    @Override
    public void endRow(int rowNum) {
        // Ensure the minimum number of columns
        for (int i=currentCol; i<minColumns; i++) {
            output.append(',');
        }
        output.append('\n');
    }

    @Override
    public void cell(String cellReference, String formattedValue,
            XSSFComment comment) {
        if (firstCellOfRow) {
            firstCellOfRow = false;
        } else {
            output.append(',');
        }

        // gracefully handle missing CellRef here in a similar way as XSSFCell does
        if(cellReference == null) {
            cellReference = new CellAddress(currentRow, currentCol).formatAsString();
        }

        // Did we miss any cells?
        int thisCol = (new CellReference(cellReference)).getCol();
        int missedCols = thisCol - currentCol - 1;
        for (int i=0; i<missedCols; i++) {
            output.append(',');
        }
        currentCol = thisCol;

        // Number or string?
        try {
            Double.parseDouble(formattedValue);
            output.append(formattedValue);
        } catch (NumberFormatException e) {
            output.append('"');
            output.append(formattedValue);
            output.append('"');
        }
    }

    @Override
    public void headerFooter(String text, boolean isHeader, String tagName) {
        // Skip, no headers or footers in CSV
    }
}

在上面链接中提供的示例中，方法“cell”只能访问格式化值，但我需要访问单元格的实际值。

共有1个答案

鲁涵意

2023-03-14

流媒体接口的当前实现不提供此功能。因此，为了实现这一点，您需要复制底层XSSFSheetXMLHandler的代码，并对其进行调整，以使单元格内容不被格式化。

类似资料：

如何获取空单元格apache POI的单元格样式

有人能给我一个解决方案，我可以在单元格为空时读取应用到单元格的样式吗？谢了。
如何从DataFrame的单元格中获取值？

我构造了一个从数据帧中提取一行条件：现在我想从一个特定列中取一个值：
如何从数据框的单元格获取值？

问题内容：我构造了一个条件，可以从我的数据帧中准确提取一行：现在，我想从特定列中获取一个值：但是结果是我得到了一个包含一行和一列（即一个单元格）的数据框。这不是我所需要的。我需要一个值（一个浮点数）。我该怎么做在熊猫里？问题答案：如果您的DataFrame仅包含一行，则使用，作为Series访问第一行（唯一），然后使用列名访问值：
如何获取Apache POI中单元格的数字格式值？

问题内容：我想以一个简单的字符串来获取数值单元格的值。假设单元格的类型是带有value的数字。现在我无法使用，因为它将引发异常。我也无法使用，因为它会使我退缩。我想存储在类型为varchar2的db中，所以我只希望字符串中的值。我无法更改单元格类型，因为它是最终用户的工作，我必须在代码本身中进行处理。格式化程序也无法正常工作，因为xls … dd：mm，dd：mm：ss，公式等中可能存在
Excel VBA：获取单元格的值，与单元格的格式无关

null 如您所料，A3将导致。现在将A2的格式更改为会计，使用小数点后2位。A2现在读，但是基础值仍然是，所以A3仍然是。 VBA 制作一个新模块并添加以下函数： null 和具有相同的基础值，但和没有，尽管它们都是使用和的方法计算的。 ()中的表达式正在访问和的实际基础值。如何在VBA中访问这些值？
如何从数据帧的单元格中获取值？

我构建了一个条件，从我的数据框中提取一行: 现在我想从特定列中获取一个值：但结果，我得到了一个包含一行和一列（即一个单元格）的数据帧。这不是我需要的。我需要一个值（一个浮点数）。我怎样才能在熊猫身上做到这一点？

ApachePOISAX解析-如何获取单元格的实际值

共有1个答案

相关问答

相关文章

相关阅读

相关工具

相关文档