2,260 questions with no upvoted or accepted answers
9
votes
1
answer
588
views
Extract adaptive launcher icon from Android apk as svg/png?
Is there a way to extract the launcher icon of an apk file, even if it is an adaptive icon (vector drawable)?
I couldn't find a single batch or python script to achieve this.
There are many scripts ...
8
votes
1
answer
580
views
Extract JSON-LD from HTML using Apache Any23
My aim is to extract structured data from webpages. I'm using the code mentioned in this SO question. I'm using Apache Any23 CLI library dependency in my Spring project.
By using this, I'm able to ...
7
votes
0
answers
15k
views
How to extract charts/tables/graphs from PDF files using Python?
Searched quite a bit but as I couldn't find a solution for this kind of problem, hence posting a clear question on the same. Most answers cover image/text extraction which are comparatively easier.
I'...
7
votes
1
answer
8k
views
passing a stringstream to istream using operator >>
I am trying to pass a stringstream into an object(class) that has an overloaded extraction operator >> that is declared and defined. For example, the declaration for the overloaded extraction ...
6
votes
0
answers
3k
views
How to scrape tweets after July 2023?
Since Twitter's API is now impracticable for research and scrappers like SnScrape are not working because of the "login block" on the Twitter's search engine, there is a way to extract ...
6
votes
0
answers
4k
views
Extract all video frame from mp4 video using OpenCV and C++
I'm following a tutorial to extract video frames. I've read this question, it doesn't work, also queationfrom Open CV Answer, but the solution is for capturing current frame. I have a 120fps video and ...
5
votes
1
answer
10k
views
extract or unzip files in gitlab using ci/cd command line
Actually, I have my zip files on gitlab, I want to extract those files using gitlab CI/CD.I have tried this in .gitlab-ci.yml:
image: docker
stages:
- build
- test
services:
- docker:dind
...
5
votes
0
answers
5k
views
Reading DWG file in Python and extracting edge points
I have a DWG file in which I have a rectangle with several lines in it (e.g. floor plan with interior walls). How can I use Python to extract the edges (X,Y Coordinates)? I need to extract the floor ...
5
votes
0
answers
990
views
Tika 1.1 Performance Improvement
I am using tika 1.1, I am facing issue that tika is taking long time for extracting the content from file.
For extracting 1MB of pdf/doc file it taking time around ~3Second.
Is there any way to ...
5
votes
0
answers
5k
views
How can I extract all the keyframes from an MP4 video using ffmpeg?
I would like to extract all keyframes from a video in BMP format for further processing. I managed to investigate this command:
ffmpeg -skip_frame nokey -i videofile.mp4 -vf select='eq(pict_type\,I),...
4
votes
0
answers
2k
views
how can i extract game resources made by Unity3d?
I operated according to the tutorial of the following websitethis is the tutorial, but found that the contents of the file were different from those in the tutorial. As a result, when I extracted the ...
4
votes
0
answers
552
views
Angular i18n - Reverse extract translations to source code?
I've built an Angular application which obviously contains UI copy: labels for buttons, messages for errors, hints for form validation, etc. These texts are directly placed in the source code by me, a ...
4
votes
0
answers
1k
views
How to extract react js code from a website like google chrome dev tool does?
I would like to know how Google Chrome can display me the entire react source code of a frontend application when I visit a React website.
For example If I open a website I have this in the source tab ...
4
votes
1
answer
972
views
How to extract hyperlinks from office documents using tika
I'm using Apache Tika to extract raw text from various document formats including office.
When extracting text from word documents that include hyperlinks, then only the text is extracted and the ...
4
votes
0
answers
1k
views
Elasticsearch Keyword Extraction
I have a corpus of ~10K articles. For each article I would like to extract keywords (tags). So for every article I would like a ranking of the tokenized terms in the article based on their frequency ...
4
votes
0
answers
693
views
How to get Youtube Start, Middle, End, x Sec Thumbnail in maximum resolution
I was wondering whether the start, middle and end thumbnails are also available in max resolution?
As descriped in "How do I get a YouTube video thumbnail from the YouTube API" the default thumbnail ...
4
votes
1
answer
3k
views
Oracle Query to get data from more complex XML clob
I have a fairly complex clob stuffed with xml in oracle that i would like to parse in SQL query like I have used in the past with Extract and ExtractValue.
Name &...
4
votes
0
answers
1k
views
Is it possible to capture/save JPEGs from an MJPEG stream? (and also the MJPEG itself)?
I am using MJPEG-Streamer to stream an MJPEG video file over http, similar to how this guy does it. MJPEG-Streamer basically creates a streaming server on the streaming device which hosts the MJPEG ...
4
votes
0
answers
2k
views
Flatten vector graphics inside pdf and extract using java
I am trying to get sizes (width and depth) of images embedded in a PDF file. The images in the PDF are all high resolution vector images.
I tried using PDFBox. PDFBox libraries extract images ...
4
votes
0
answers
627
views
Extracting movie frames quickly in a 64-bit Cocoa app on Snow Leopard
I wrote a 64-bit Cocoa app on Snow Leopard that extracts individual frames
from a movie using the QuickTime API. It uses [QTMovie currentFrameImage],
which unfortunately is very slow, as was pointed ...
4
votes
3
answers
11k
views
Printing top few lines of a large JSON file in Python
I have a JSON file whose size is about 5GB. I neither know how the JSON file is structured nor the name of roots in the file. I'm not able to load the file in the local machine because of its size So, ...
3
votes
0
answers
150
views
Extracting multi agent RL patterns from AST representation
I have an AST representation of a multi agent reinforcement learning algo. I am trying to extract certain instances of code which tell us it is a multi agent representation. Some of them are:
I want ...
3
votes
1
answer
260
views
Accessing Name, Firstname... from a VObject in Python
I might be very stupid here, and my knowledge about Python is nearly at 0.
I get a vcard from my Mac Contacts - I can access it and transform most of the data. Simple outputs are easy. I lack when I ...
3
votes
0
answers
516
views
Tensorflow Object Detection and Image Cropping for Data Extraction
Below is my code that detect the regions (tables, paragraphs) from invoice and and crop the detected region from the invoice. I am facing issues while performing data extraction on cropped images as ...
3
votes
1
answer
251
views
Moving records from one database instance to another
I have a PostgreSQL database instance located in EU region. I plan on introducing another PostgreSQL database instance located in a new geographical region.
As part this work, I am to migrate data for ...
3
votes
0
answers
620
views
SQL Server linked server to a Microsoft Dataverse environment
I would like to connect from an on premise Microsoft SQL Server environment to a Dataverse environment in Azure. I want to be able to download data from Dataverse to SQL Server. I would like to know ...
3
votes
0
answers
1k
views
Extracting Headings/Chapters and related paragraphs separately from PDF file in Python 3.7
My task is to fetch chapter-wise content from pdf file separately so that i can store into database.
So far, i tried regex and tried to split but that only gives me chapter number but didn't help me ...
3
votes
1
answer
141
views
Is there a page limit for predicting using GCP AutoML for Custom Entity Extraction
I am trying to explore the custom entity extraction using GCP AutoML, I have a 10 page documents used for training the model, where my entities are trained those were 7 & 8 page as well.
While ...
3
votes
0
answers
2k
views
Extracting files with Japanese characters from a zip archive
Everything inside the zip file has a Japanese name (e.g. the directories inside, pdfs inside etc.). When I tried using
Archive::Zip
or
Archive::Extract,
it fails at a single point (Input/Output error) ...
3
votes
0
answers
2k
views
Extracting financial tables embedded in PDF document
I am trying to parse annual reports of companies, that are in pdf format (from annualreports.com). I have downloaded approximately 5000 pdf docs, each with text and table data. I wish to extract the ...
3
votes
0
answers
811
views
OpenAI - Algorithm: Implement the extract function to extract database by making calls to query
This is a question asked by OpenAI and submission date has passed away so finally I can ask this because I am very curious to know the answer.
Question:
You're on a website (such as Github!) with a ...
3
votes
0
answers
264
views
Pdf Text wrong character extraction
I have a pdf page with a formula as:
When text is extracted, few characters are wrong. Text looks like this:
/ToUnicode Object 33 0 R unfiltered stream looks like this:
Encoding looks like this:
...
3
votes
1
answer
633
views
sympy - how to additively combine exponents when expanding?
My main goal is to identify coefficients in front of exponentials of the form
exp(1j*k*r)
in expressions such as:
(z1*exp(1j*k1*r1) + z2*exp(1j*k2*r2) + c.c.)**2
by first expanding and then using ...
3
votes
0
answers
1k
views
Configure properties of boundingRect() in OpenCV (Python)
I have the following code which, given an image, it extract ROI from it.
Warning: as is, the code will save all extracted ROIs (45 files) on Desktop.
import cv2
extr_path = ('C:\\Users\\Bob\\...
3
votes
0
answers
196
views
Extracting non-object values from ecocrop in R
I'm working on crop-climate modelling using Robert Hijmans' package 'dismo' and more specifically the function 'ecocrop'. Ecocrop calls a series of 'hidden' functions including, .doEcocrop (ecocrop ...
3
votes
0
answers
943
views
Extracting the dominant colour in an image
I couldn't find a way to do this in R so thought I'd ask.
I'm trying to extract the dominant colour from a set of images (in this case, soccer jerseys) and having no luck.
The method yielding most ...
3
votes
1
answer
657
views
How to extract the main content from a webpage?
I am trying to write a summary of the content of a web page. For that I need to extract all the irrelevant text and data from a webpage.
I have used boilerpipe, but the text extraction is not good....
3
votes
1
answer
704
views
Copy a specific directory from Phar archive in PHP
All is in the header. My application is stored in Phar.
My workflow is really simple.
I work with PHP 5.6.
I do some stuff in my application and at a moment, I would like to copy a specific ...
3
votes
3
answers
27k
views
extract a date from a cell containing text string in excel
For the Twelve Months Ending Thursday, June 30, 2016
I have the above text in excel and I need an excel formula to extract ONLY the date June 30, 2016. The formulas I've researched were all for ...
3
votes
0
answers
477
views
Select dictionary elements in a Pandas series
I have sample data which looks like the following (these are two separate rows delimited by tabs):
Details
[{'name': 'Irrelevant_Data',
'parentName': 'Irrelevant_Scrape',
'parentId': '2662610',
...
3
votes
1
answer
2k
views
How to extract numbering and text from .docx file
How do I extract numbering and text from a .docx file using Java and the Apache POI XWPF library?
I am using the following code:
public static void readDocxFile() {
try {
File file = ...
3
votes
0
answers
292
views
Microdata Extraction with MicrodataExtractor of Any23
I am trying to understand microdata extraction with Any23, but can't find much of useful documentation.
Can anybody give me a simple example of how to use MicrodataExtractor to extract microdata ...
3
votes
0
answers
192
views
iMacros code to search and find a work in a page and then send a message if that word exists in that page
I am trying to write a code in Imacros. What I am trying to do is,see if a string of words exist in a page, if it does, not ,then go to next page. If it does, then send a message (I have working code ...
3
votes
1
answer
1k
views
FFMPEG: How to extract multichannel track from m4v, mix it down and save the stereo downmix as "left" and "right"?
just like the title already says: I want to extract a multichannel track (5.1) from an .m4v, mix this track down and save the output as separate files, so in the end I want to have something like '...
3
votes
0
answers
146
views
How can I utilize BLS.GOV API to extract data from their graphs to my website?
I'm very new to programming and using API. I am currently working on a project to take data from BLS.GOV using their API to port the data to my website! so far, BLS has given me this code:
$url = ...
3
votes
1
answer
3k
views
Use python to extract and plot data from netCDF
I am new to using python for scientific data so apologies in advance if anything is unclear. I have a netCDF4 file with multiple variables including latitude, longitude and density. I am trying to ...
3
votes
2
answers
2k
views
R-raster extraction along SpatialLine: relate extracted values to actual distance
When extracting values of a raster along a SpatialLine in R, how to relate these values to the actual distance along this line?
Suppose I want to extract the value of the R logo along the following ...
3
votes
0
answers
4k
views
What affects unzipping speed?
I do not necessarily have a problem, so this question is just out of curiosity: "What affects the extraction speed of a .rar/.zip/.tar file?"
I know some probable factors, such as RAM, HDD/SDD ...
3
votes
0
answers
3k
views
Extracting Folder from Resources to disk
I am trying to copy the Files of a Resources Folder to disk.
final URL url = TemplateUtils.class.getResource("/templates/");
LOG.info(url);
if (url != null) {
...