12 questions from the last 30 days
3
votes
2
answers
110
views
If I am using UTF-8 strings is it risky to use standard string handling that assumes null termination?
From what I understand it is very rare for UTF-8 strings to have embedded NULLs, however there is the case that a person can put a NULL into a Unicode string explicitly with "X\0Y" or ...
0
votes
1
answer
120
views
How to load a TStringStream from database with DBConnection set to UTF-8 charset?
We are changing the database connection settings for our project from WIN1252 to UTF8. (FireDAC).
It is a PostgreSQL database created with UTF-8 encoding.
There are some text fields that we load from ...
0
votes
2
answers
66
views
UnicodeDecodeError codec can't decode error using pandas read_csv
I'm opening a csv file using pandas.
import pandas as pd
df = pd.read_csv('/file/planned.csv')
I'm opening a file that contains about 2,000 records collected from all over the places in the world. ...
0
votes
1
answer
67
views
How do I get aspPdf to write special characters like æ, ø and å?
I am using aspPdf to convert a html page into a pdf file. And it is working fine, except from the special characters æ, ø and å.
I have this:
<!--#include file="include/connectOpen.asp"--&...
0
votes
1
answer
80
views
Convert file csv with UTF-8 format to ANSI in C#
I have .CSV files with UTF-8 encoding, and I want to convert these files to ANSI format, but my code doesn't work!
static void Main()
{
Encoding.RegisterProvider(CodePagesEncodingProvider.Instance)...
0
votes
1
answer
80
views
load data stopped prematurely
I uploaded a file with 4 million rows to a table.
The table is a database of vehicle license plates along with all the characteristics of that vehicle (color, tire size, engine capacity, and more).
...
1
vote
1
answer
71
views
How to search a utf8 string in word files using powershell [duplicate]
I created a PowerShell script with assistance from GitHub Copilot. It works well with ASCII characters, but when I try to search for UTF-8 characters, it doesn’t return any results. For example, when ...
2
votes
0
answers
46
views
Start-Transcript always inserts a NULL character (0x00) after every Japanese character
Background
I am using Start-Transcript/Stop-Transcript cmdlet to save the messages of commands launched from my PowerShell script.
Problem
I ran commands that contain Japanese characters in messages ...
0
votes
0
answers
21
views
Meta charset doesn't consistently respect French accents
Environment: html website (no cms), local repo managed with visual studio, connected to an AzureDevOps staging environment. Meta charset defined as "utf-8".
Issue: If I paste French copy ...
0
votes
0
answers
39
views
how to get generate utf-8 encoding from unicode codepoints in Private Use Area planes in python [closed]
How can I generate the hex codes of a utf-8 encoding from unicode codepoints in Private Use Area (PUA) planes?
In other planes, such as BMP, this can be done easily with chr():
for instance:
chr(107)....
-1
votes
0
answers
71
views
Fixing ‘ascii’ codec can’t encode ‘\u2014’ error in OpenAI API during vector store embedding
I am working on a RAG-related project and attempted to store loaded pages into a vector store for building a web-referenced RAG pipeline. However, I encountered the following error during the ...
0
votes
1
answer
18
views
DolphinDB: Data type-related error upon loading a CSV file into a partitioned table using
Supposing there is a CSV file containing the following columns: “Code“, “Timestamp“, “Opening Price“, “Highest Price“, “Lowest Price“, “Closing Price“, “Trading Volume“, “Trading Amount“, and “...