Skip to main content
Filter by
Sorted by
Tagged with
0 votes
0 answers
17 views

How to extract a specific column information from a pdf using flutter

I have a pdf file which has a table. Inside the table I have multiple columns. I want to extract the data from a specific column for all the entries. The pdf might contain multiple pages. I am able to ...
0 votes
0 answers
27 views

Recursively Extracting Nested Archive Files

I have 500+ archive files that contain other archive files (.zip, .rar, .7z). For example: archive.zip | |-file.abc | |-nested_archive.7z | file.def | ...
0 votes
1 answer
42 views

Extract specific dictionary value from dataframe in PySpark

I have a below dataframe dataDictionary = [('value1', [{'key': 'Fruit', 'value': 'Apple'}, {'key': 'Colour', 'value': 'White'}]), ('value2', [{'key': 'Fruit', 'value': 'Mango'}, {'...
1 vote
0 answers
75 views

Extracting audio from a video (camera trapping) and generate GUANO metadata

I have a large quantity (thousands) of video files (.MP4) generated by camera traps. I want to extract the sounds (as .WAV or any other relevant format for further analysis in an acoustic software ...
-1 votes
4 answers
71 views

API JSON javascript extract data loop array

I've succeeded at connecting and getting JSON data from API using this method: <script type="text/javascript"> fetch('https://api.web_address.com/vi/locations/10', { method: '...
0 votes
1 answer
56 views

Extract specific conditional string SQL Redshift

I want to extract only val2 from the below column1 of a table, I'm not sure how to do it with SQL though I've tried with regexp_subtr/instr, the length of column values are dynamic but the contents of ...
2 votes
2 answers
10k views

How to convert qcow2 to file-system or archive file

before I start to write my problem, be not so hard to me, english is not my mother language. My Debian10 on a dedicated Server get broken, but I have a qcow2 image. I copy the image to a other server ...
1 vote
1 answer
33 views

It is not possible to extract specific columns of data in gnuplot based on a specific heading

In the following Gnuplot script, I want to extract values from a specific table. This works fine for columns that contain numbers but I cannot extract the path from the column "Measure:volume&...
18 votes
5 answers
14k views

Extract abstract / full text from scientific literature given DOI or Title

There are quite a lot of tools to extract text from PDF files[1-4]. However the problem with most scientific papers is the hardship to get access PDF directly mostly due to the need to pay for them. ...
1 vote
1 answer
70 views

How to convert a human-readable timeline to table using existing ML tools?

I have this timeline from a newspaper produced by my Native American tribe. I was trying to use AWS Textract to produce some kind of table from this. AWS Textract does not recognize any tables in this....
4 votes
2 answers
24k views

PostgreSQL Extract() DOW : why not Date datatype in addition to timestamp?

According to the PostgreSQL version 8.1 date-time function docs: dow The day of the week (0 - 6; Sunday is 0) (for timestamp values only) SELECT EXTRACT(DOW FROM TIMESTAMP '2001-02-16 20:38:40'); ...
23 votes
5 answers
88k views

How to extract the source code from a *.jar file on a Mac?

I'm very confused. I downloaded a *.jar file as a bit of software. So, I would like to extract the source code to look at it I used the command jar xf filename.jar which returned two more *.jar ...
2 votes
5 answers
100 views

Splitting one column to three columns for uneven characters in r

I have tried to split a single column to three columns. But I failed. I have the following data set > dat name Jhon Austin B 100kg Mick Gray C 110kg Tom Jef A 30kg First I tried to extract ...
0 votes
1 answer
37 views

How to Extract Emails from .pst Files Using Python?

I have .pst file of outlook emails from which i want to extract the emails. what approach should I follow to extract the emails from this. i'm writing python code for it. data extraction from .pst ...
1 vote
2 answers
84 views

How can I scrap the data within the "informations détaillées" section of this URL : https://gallica.bnf.fr/ark:/12148/cb42768809f/date?

Hi guys i am brand new with webscraping. I am trying to webscrape the the data within the "informations détaillées" section of this webpage (https://gallica.bnf.fr/ark:/12148/cb42768809f/...
1 vote
2 answers
708 views

Data extraction from HEC-RAS

I'm using Hec-Ras for 2D unsteady modeling of a river delta. My model is simulated for one year. I need to extract the velocities and/or discharges and compare them with the velocities from a already ...
0 votes
0 answers
30 views

How to Extract Audio From BGM.DAT file?

I know that the music is contained in the file called BGM.DAT and the file is .3gb so online converters are no use because of their file size limitations. How do I extract the audio files form the ....
1 vote
2 answers
5k views

Power Automate Regex expression

I need a Power Automate expression to extract a substring from a string, based on the regex AA-BBB-\d{1,6} but apparently Power Automate doesn't offer a regex match/extract function via expressions. ...
2 votes
3 answers
156 views

In Excel, how to extract and convert unsorted comma-seperated data from one column into multiple columns?

In Excel, I have a table where rows correspond to the respondents of a survey and columns correspond to the respondents' answers to some questions. When asked what the respondents grow on their fields,...
1 vote
3 answers
14k views

convert XML to table in oracle

I have a XML in the below format, store in column of XMLType in oracle database <a> <c>1</c> <c>2</c> </a> I need to convert this in table format as c 1 2 Any ...
0 votes
1 answer
52 views

How to Convert a PDF Table with Thousands of Rows into JSON in React

I am working on a project where I need to convert a PDF containing a large table (thousands of rows) into a JSON Array of object. The PDF has a table with headers that should be used as keys in the ...
0 votes
1 answer
31 views

extract vector charts from PDF

I can't find the solution to how to isolate or extract vector chart and graphs(that are not images) from pdf. I have tried extract directly, but I realize that it is not that straight forward. I was ...
0 votes
0 answers
27 views

Textract extract 2 columns of QUANTITY type data. What would be the best way to detect which is the good one?

I'm using AWS Textract with AnalyzeExpense to extract data from P.O. But sometime, P.O. have "QTY ORDERED" and "QTY SHIPPED" and both are detected as QUANTITY. So I end up with 2 ...
-3 votes
2 answers
56 views

How to extract items from a list in Excel 2019

There is list of values (not text, numbers only) in document (for example from B3 to B14). Values may appear in different cells in the list and not always in all of these cells of the list. There may ...
0 votes
2 answers
624 views

How extract picture from pdf file

I want to extract picture from pdf files by C++,but I don't understand the picture format in pdf files,does someone can help me? I looked the content of pdf files by opening it with Notepad, I tried ...
5 votes
3 answers
7k views

How to extract column names from SQL query using Python

I would like to extract the column names of a resulting table directly from the SQL statement: query = """ select sales.order_id as id, p.product_name, sum(p.price) as ...
0 votes
1 answer
44 views

Preserve xy coordinates while using terra::extract with vector data

I have a list of xy coordinates (originally from a raster) that I want to associate with their political unit (e.g., point a is in New York, point b is in Vermont). Associating a point with a separate ...
5 votes
1 answer
10k views

extract or unzip files in gitlab using ci/cd command line

Actually, I have my zip files on gitlab, I want to extract those files using gitlab CI/CD.I have tried this in .gitlab-ci.yml: image: docker stages: - build - test services: - docker:dind ...
0 votes
1 answer
1k views

extracting text from a pdf with pdftotext

I am trying to extract text from a pdf with python and pdftotext. I have some weird characters and I don't know what's wrong. For example, for this PDF at page 4: It gives me this: There is a ...
0 votes
1 answer
167 views

Data extract from Youtube Channel

I got the code to extract the list of the videos with titles and their links from YouTube. It works with videos only but not for the shorts. The codes are as: Code 1: var scroll = setInterval(...
0 votes
0 answers
30 views

is pdfjs-dist external library is compatible for chrome extension , if yes then what are the step to add into plugin?

I want to extract data from a PDF file in my plugin and use the data for information. I am trying with pdfjs-dist, but it breaks my gulp-building process, showing pdf-dist as an external dependency. ...
1 vote
1 answer
51 views

How to extract text associated with image from pdf?

I am using pymupdf to extract images from PDF. Code sample is as below. import pymupdf doc = pymupdf.open('sample.pdf') page = doc[0] # get the page image_list = page.get_images() page_index = 0 ...
2 votes
2 answers
127 views

Regex exact match

I have the following sentence: "The size of the lunch box is around 1.5l or 1500ml" How can I change this to: "The size of the lunch box is around 1.5 liter or 1500 milliliter" In ...
0 votes
1 answer
1k views

Python requests: Extracting data from payload

Currently I am looking into the Python requests module. I am trying to automate a task for which I need to pass on a csrf token. The csrf token can be found in the payload of a previous request. How ...
-1 votes
1 answer
874 views

Fine-Tuning Large Language Model on PDFs containing Text and Images [closed]

I need to fine-tune an LLM on a custom dataset that includes both text and images extracted from PDFs. For the text part, I've successfully extracted the entire text data and used the OpenAI API to ...
0 votes
1 answer
38 views

Decoding X32 Behringer mixer response

I would like help please to get the 4 Return meter values (post gain/trim, gate, dyn gain reduction and post‐fade) from the X32 behringer mixer, these are in 32 bits floats, little‐endian coded. to ...
0 votes
0 answers
11 views

Conditional Data Extraction from Nested JSON Using Recursive Search Fails to Return Non-Empty Specific Values

I'm working on a Python script to extract specific values from (several 100,000) deeply nested JSON files. While my script correctly extracts most of the desired fields using recursive searching, it ...
-1 votes
3 answers
287 views

Accessing data from a JSON file

I am quite new with JSON. My code consists in extracting data from a website which requires a API key. Having extracted the information. I am trying to reach the information which is encoded in JSON ...
2 votes
2 answers
1k views

In Flutter, how to extract frame by frame from an animated gif?

I'm trying to extract frames from an animated gif using Flutter. By researching the Flutter API, I thought the following code should work but it only gives me the first frame although it gives me the ...
3 votes
3 answers
27k views

extract a date from a cell containing text string in excel

For the Twelve Months Ending Thursday, June 30, 2016 I have the above text in excel and I need an excel formula to extract ONLY the date June 30, 2016. The formulas I've researched were all for ...
0 votes
2 answers
782 views

Extracting subtitle from a teachable/hotmart video

After may hours, I found a way how to extract the subtitle text from a teachable/hotmart video in a half-automatic way. Is there any way to automate the process? The problem for a more sophisticated ...
0 votes
1 answer
43 views

How can I extract specific values from a .csv-File and add them into a specific cell in a pre-exisiting dataframe/tibble in R automatically?

I want to automatically extract specific values from a .csv-File, which is generated by our measuring device, into a a dataframe/tibble in R which has a pre-defined layout. The name of the measured ...
1 vote
1 answer
325 views

Extract 301 Redirect URL from Google Sheets - Script Editor [duplicate]

I've been looking a way to improve an awesome Google Sheets Script I found on StackOverflow: function getStatusCode(url){ var options = { 'muteHttpExceptions': true, 'followRedirects': ...
0 votes
2 answers
58 views

Extract an item by name from an MapType() column in a PySpark column

I have a PySpark dataframe structured like this, where the array is formatted to start with a number: 1: "item / state / zip" most, but not all of the time. Sometimes the order will be ...
5 votes
3 answers
4k views

Extract meta description field with goquery

I'm using the goquery package to extract pieces of information from a webpage. Please see my code below. The outcome when after running the function is: Description field: text/html; charset=iso-...
0 votes
3 answers
75 views

Need Excel VBA program that will extract two $numbers from column A and place them in Col B&C

I found some python code on this site via a google search and adopted it for Excel VBA, but it doesn't quite work. Column A has multiple rows with two numbers. The numbers always follow a dollar sign ...
3 votes
2 answers
454 views

pdfplumber only extracting the first row of data of every page in 300+ page document

I'm trying to extract all the rows of data from the following pdf: https://www.seguridadaerea.gob.es/sites/default/files/aeronaves_inscritas.pdf I've written a Python script that uses the pdfplumber ...
0 votes
0 answers
167 views

Extracting images from a table column in a PDF and saving them to Excel

When using pdfplumber, I can't extract the images from a specific table column. I use the following packages to get the images: import fitz # PyMuPDF from PIL import Image I need to use pdfplumber ...
0 votes
0 answers
22 views

While running Feature Extraction for VIT model, I am getting Resource Exhausted Error

While running Feature Extraction for VIT model for 2000 images, I am getting Resource Exhausted Error. We are using Google Colab Pro plan. We selected Compute Machine as T4 GPU. We find that GPU RAM ...
2 votes
2 answers
14k views

How can extract data from .h5 file and save it in .txt or .csv properly?

After searching a lot I couldn't find a simple way to extract data from .h5 and pass it to a data.Frame by Numpy or Pandas in order to save in .txt or .csv file. import h5py import numpy as np import ...

1
2 3 4 5
159